2 Relativity and electromagnetism 



2.1 The relativity principle 

Physical phenomena are conventionally described relative to some frame of refer- 
ence which allows us to define fundamental quantities such as position and time. 
Of course, there are very many different ways of choosing a reference frame, but 
it generally convenient to restrict our choice to the set of rigid inertial frames. 
A classical rigid reference frame is the imagined extension of a rigid body. For 
instance, the Earth determines a rigid frame throughout all space, consisting of 
all those points which remain rigidly at rest relative to the Earth and each other. 
We can associate an orthogonal Cartesian coordinate system S with such a frame, 
by choosing three mutually orthogonal planes within it and measuring x, y, and 
z as distances from these planes. A time coordinate must also be defined in order 
that the system can be used to specify events. A rigid frame, endowed with such 
properties, is called a Cartesian frame. The description given above presupposes 
that the underlying geometry of space is Euclidian, which is reasonable provided 
that gravitational effects are negligible (we shall assume that this is the case). 
An inertial frame is a Cartesian frame in which free particles move without ac- 
celeration, in accordance with Newton's first law of motion. There are an infinite 
number of different inertial frames, each moving with some constant velocity with 
respect to a given inertial frame. 

The key to understanding special relativity is Einstein's relativity principle, 
which states that 

All inertial frames are totally equivalent for the performance of all 
physical experiments. 

In other words, it is impossible to perform a physical experiment which differen- 
tiates in any fundamental sense between different inertial frames. By definition, 
Newton's laws of motion take the same form in all inertial frames. Einstein gen- 
eralized this result in his special theory of relativity by asserting that all laws of 
physics take the same form in all inertial frames. 
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Consider a wave-like disturbance. In general, such a disturbance propagates 
at a fixed velocity with respect to the medium in which the disturbance takes 
place. For instance, sound waves (at S.T.P.) propagate at 343 meters per second 
with respect to air. So, in the inertial frame in which air is stationary sound 
waves appear to propagate at 343 meters per second. Sound waves appear to 
propagate at a different velocity in some other inertial frame which is moving 
with respect to the first frame. However, this does not violate the relativity 
principle, since if the air were stationary in the second frame then sound waves 
would appear to propagate at 343 meters per second in this frame as well. In other 
words, exactly the same experiment (e.g., the determination of the speed of sound 
relative to stationary air) performed in two different inertial frames of reference 
yields exactly the same result, in accordance with the relativity principle. 

Consider, now, a wave-like disturbance which is self-regenerating and does not 
require a medium through which to propagate. The most well known example of 
such a disturbance is a light wave. Another example is a gravity wave. According 
to electromagnetic theory the speed of propagation of a light wave through a 
vacuum is 

c = -— = = 2.99729 x 10 8 meters per second, (2.1) 

where eo and are physical constants which can be evaluated by performing 
two simple experiments which involve measuring the force of attraction between 
two fixed changes and two fixed parallel current carrying wires. According to the 
relativity principle these experiments must yield the same values for eo and fiQ in 
all inertial frames. Thus, the speed of light must be the same in all inertial frames. 
In fact, any disturbance which does not require a medium to propagate through 
must appear to travel at the same velocity in all inertial frames, otherwise we 
could differentiate inertial frames using the apparent propagation speed of the 
disturbance, which would violate the relativity principle. 

2.2 The Lorentz transform 

Consider two Cartesian frames S(x,y, z,t) and S'(x' ,y' , z' ,t') in the standard 
configuration in which S' moves in the x-direction of S with uniform velocity v 
and the corresponding axes of S and S' remain parallel throughout the motion, 
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having coincided at t = t' = 0. It is assumed that the same units of distance 
and time are adopted in both frames. Suppose that an event {e.g., the flashing 
of a light-bulb, or the collision of two point particles) has coordinates (x, y, z, t) 
relative to S and (x\ y' , z', t') relative to S' . The "common sense" relationship 
between these two sets of coordinates is given by the Galilean transformation: 



x' = x-vt, (2.2a) 

y' = y, (2.2b) 

z = z, (2.2c) 

t' = t. (2.2d) 



This transformation is tried and tested and provides a very accurate description 
of our everyday experience. Nevertheless, it must be wrong! Consider a light wave 
which propagates along the £-axis in S with velocity c. According to the Galilean 
transformation the apparent speed of propagation in S' is c — v, which violates 
the relativity principle. Can we construct a new transformation which makes the 
velocity of light invariant between different inertial frames, in accordance with the 
relativity principle, but reduces to the Galilean transformation at low velocities, 
in accordance with our everyday experience? 

Consider an event P and a neighbouring event Q whose coordinates differ 
from those of P by dx, dy, dz, dt in S and by dx' , dy' : dz', dt' in S' . Suppose 
that at the event P a flash of light is emitted and that Q is an event in which 
some particle in space is illuminated by the flash. In accordance with the laws 
of light-propagation, and the invariance of the velocity of light between different 
inertial frames, an observer in S will find that 

dx 2 + dy 2 + dz 2 - c 2 dt 2 = (2.3) 

for dt > 0, and an observer in S' will find that 

dx' 2 + dy' 2 + dz' 2 - c 2 dt' 2 = (2.4) 

for dt' > 0. Any event near P whose coordinates satisfy either (2.3) or (2.4) is 
illuminated by the flash from P and therefore its coordinates must satisfy both 
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(2.3) and (2.4). Now, no matter what form the transformation between coordi- 
nates in the two inertial frames takes, the transformation between differentials at 
any fixed event P is linear and homogeneous. In other words, if 

x? = F{x,y,z,t), (2.5) 

where F is a general function, then 

dx' = — dx + — dy + — - dz + — - dt. (2.6) 
ox oy oz ot 

It follows that 

dx' 2 + dy' 2 + dz' 2 - c 2 dt' 2 = adx 2 + bdy 2 + c dz 2 + ddt 2 + g dx dt + hdy dt 

+k dz dt + I dy dz + m dx dz + n dx dy, (2-7) 

where a, 6, c, etc. are functions of x, y, z, and t. We know that the right-hand 
side of the above expression vanishes for all real values of the differentials which 
satisfy Eq. (2.3). It follows that the right-hand side is a multiple of the quadratic 
in Eq. (2.3); i.e., 

dx 2 + dy 2 + dz 2 - c 2 dt 2 = K{dx' 2 + dy' 2 + dz' 2 - c 2 dt' 2 ), (2.8) 

where K is a function of x, y, z, and t. [We can prove this by substituting into 
Eq. (2.7) the following obvious zeros of the quadratic in Eq. (2.3): (±1,0,0, 1), 
(0, ±1, 0, 1), (0, 0, ±1, 1), (0, l/y/2, l/y/2, 1), (l/v/2, 0, l/y/2, 1), (1/ A 1/V% 0, 1): 
and solving the resulting conditions on the coefficients.] Note that K at P is also 
independent of the choice of standard coordinates in S and S' . Since the frames 
are Euclidian, the values of dx 2 + dy 2 + dz 2 and dx' 2 + dy' 2 + dz' 2 relevant to P 
and Q are independent of the choice of axes. Furthermore, the values of dt 2 and 
dt' 2 are independent of the choice of the origins of time. Thus, without affecting 
the value of K at P we can choose coordinates such that P = (0, 0, 0, 0) in both 
S and S' . Since the orientations of the axes in S and S' are, at present, arbitrary, 
and since inertial frames are isotropic, the relation of S and S' relative to each 
other, to the event P, and to the locus of possible events Q is now completely 
symmetric. Thus, we can write 



dx 
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+ dy' 2 + dz' 2 - c 2 dt' 2 = K(dx 2 + dy 2 + dz 2 - c 2 dt 2 ), (2.9) 
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in addition to Eq. (2.8). It follows that K = ±1. K = — 1 can be dismissed 
immediately, since the intervals dx 2 +dy 2 +dz 2 -c 2 dt 2 and dx' 2 +dy' 2 +dz' 2 -c 2 dt' 2 
must coincide exactly when there is no motion of S' relative to S. Thus, 

dx' 2 + dy' 2 + dz' 2 - c 2 dt' 2 = dx 2 + dy 2 + dz 2 - c 2 dt 2 . (2.10) 

Equation (2.10) implies that the transformation equations between primed and 
unprimed coordinates must be linear. The proof of this statement is postponed 
until later. 

The linearity of the transformation allows the coordinate axes in the two 
frames to be orientated so as to give the standard configuration mentioned earlier. 
Consider a fixed plane in S with the equation Ix + my + nz + p = 0. In S' this 
becomes, say, l(aix' -\-biy' -\-ciz' + d\t' -\-ei)-\-m(a2x' -\- ■ ■ •) + nia^x' ' + ■ ■ -)+p = 0, 
which represents a moving plane unless ld\ + mdi + nd^ = 0. That is, unless the 
normal vector to the plane (I, m, n) in S is perpendicular to the vector (d\, e^, ^3)- 
All such planes intersect in lines which are fixed in both S and S' , and which 
are parallel to the vector (di,d2,d^) in S. These lines must correspond to the 
direction of relative motion of the frames. By symmetry, two such frames which 
are orthogonal in S must also be orthogonal in S' . This allows the choice of two 
common coordinate planes. 

Under a linear transformation the finite coordinate differences satisfy the same 
transformation equations as the differentials. It follows from Eq. (2.10), assum- 
ing that the events (0, 0, 0, 0) coincide in both frames, that for any event with 
coordinates (x,y,z,t) in S and (x' , y', z' , t') in S' the following relation holds: 

2 1 2 1 2 2.2 /2 1 /2 1 /2 2./2 /n t-t\ 

x + y + z — c t = x + y + z — c t . (2-11) 

By hypothesis, the coordinate planes y = and y' = coincide permanently. 
Thus, y = must imply y' = 0, which suggests that 

y' = Ay, (2.12) 

where A is a constant. We can reverse the directions of the x- and z-axes in 
S and S' , which has the effect of interchanging the roles of these frames. This 
procedure does not affect Eq. (2.12), but by symmetry we also have 

y = Ay'. (2.13) 
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It is clear that A = ±1. The negative sign can again be dismissed, since y = y' 
when there is no motion between S and S' . The argument for z is similar. Thus, 
we have 

y' = y, (2.14a) 



z' = z, (2.14b) 



as in the Galilean transformation. 



Equations (2.11) and (2.14) yield 



x 2 - c 2 t 2 = x' 2 - c 2 t' 2 . (2.15) 

Since, x' = must imply x = vt, we can write 

x' = B(x-vt), (2.16) 

where B is a constant (possibly depending on v). It follows from the previous 
two equations that 

t' = Cx + Dt, (2.17) 

where C and D are constants (possibly depending on v). Substituting Eqs. (2.16) 
and (2.17) into Eq. (2.15) and comparing the coefficients of x 2 , xt, and t 2 , we 
obtain 

b=d = id-ttW" - (2 - 18a) 

c = w^iw - (2 - 18b) 

We must choose the positive sign in order to ensure that x' — > x as vjc — > 0. 
Thus, collecting our results, the transformation between coordinates in S and S' 
is given by 

, x — vt . . 

X = (l-^/c 2 ) 1 ^' (2 - 19a) 

y' = y, (2.19b) 

z' = z, (2.19c) 
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This is the famous Lorentz transform. It ensures that the velocity of light is 
invariant between different inertial frames, and also reduces to the more familiar 
Galilean transform in the limit v/c <C 1. We can solve Eqs. (2.19) for x, y, z, and 
t to obtain the inverse Lorentz transform: 



x 



x' + vt' /n - 

(2.20a) 



(l_ v 2/ c 2)l/2> 

y = y\ (2.20b) 

= z\ (2.20c) 



z 



t' + vx'/c 2 , n n 1X 

* = (1-,2/cW (2 - 2M) 

Clearly, the inverse transform is equivalent to a Lorentz transform in which the 
velocity of the moving frame is — v along the £-axis instead of +i>. 



2.3 Transformation of velocities 

Consider two frames S and S' in the standard configuration. Let u be the velocity 
of a particle in S. What is the particle velocity in 5"? The components of the 
velocity are 

dx 

Ul = — , (2.21a) 



u 2 = % (2.21b) 

dz 

u 3 = -jr, (2.21c) 



and, similarly, the components of u' are 



u\ = ^, (2.22a) 



dt' 

«i = %, (2.22b) 
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u 



3 



dz' 
~dt> 



(2.22c) 



Now we can write Eqs. (2.19) in the form dx' = ^(dx — vdt), dy' = dy, dz' = dz, 
and dt' = ^(dt — vdx/c 2 ), where 

7 = (1 _ J /c iy ft ^ 

is the well known Lorentz factor. If we substitute these differentials into Eqs. (2.22) 
and make use of Eqs. (2.21), we obtain the transformation formulae 

< = i ( 2 - 24a ) 
1 — u\v jc A 



u 



u 2 



.1 

7(1 — uiv/c 2 ) ' 



(2.24b) 



= — U3 — . (2.24c) 

3 7 (l-wiv/c 2 ) v y 

As in the transformation of coordinates, we can obtain the inverse transform by 

interchanging primed and unprimed symbols and replacing -\-v with —v. Thus, 

1 + u 1 v/c A 

U2 = / 2V ( 2 - 25b ) 

7(1 + u 1 v/c A ) 
u' 

U3 = M-l ' I 2V ( 2 - 25c ) 
7(1 + u 1 v/c A ) 



Equations (2.25) can be regarded as giving the resultant, u = (^1,^2,^3), 
of two velocities, v = (^,0, 0) and u' = (u' 1: u' 2 ,u' 3 ), and are therefore usually 
referred to as the relativistic velocity addition formulae. The following relation 
between the magnitudes u = {u 2 + u 2 2 + u^) 1 / 2 and u' = (w^ 2 + u' 2 + w^ 2 ) 1 / 2 
of the velocities is easily demonstrated: 



2 2 c'(c'-u")(c'-v') 



c u = 1 2 , -IT* -■ ( 2 - 26 ) 

[c A + w^v)^ 



13 



If u' < c and v < c the right-hand side is positive, implying that u < c. In other 
words, the resultant of two subluminal velocities is another subluminal velocity. 
It is evident that a particle can never attain the velocity of light relative to a given 
inertial frame, no matter how many subluminal velocity increments it is given. 
It follows that no inertial frame can appear to propagate with a superluminal 
velocity with respect to any other inertial frame (since we can track the origin of 
a given inertial frame using a particle which remains at rest at the origin in that 
frame). 

According to Eq. (2.26), if u' = c then u = c no matter what value v takes; 
i.e., the velocity of light is invariant between different inertial frames. Note that 
the Lorentz transform only allows one such invariant velocity (i.e., the velocity 
c which appears in Eqs. (2.19) ). Einstein's relativity principle tells us that any 
disturbance which propagates through a vacuum must appear to propagate at the 
same velocity in all inertial frames. It is now evident that all such disturbances 
must propagate at the velocity c. It follows immediately that all electromagnetic 
waves must propagate through the vacuum with this velocity, irrespective of 
their wavelength. In other words, it is impossible for there to be any dispersion 
of electromagnetic waves propagating through a vacuum. Furthermore, gravity 
waves must also propagate with the velocity c. It is convenient to label c as 
"the velocity of light" since electromagnetic radiation is, by far, the most well 
known and easily measurable type of disturbance which can propagate through 
a vacuum. 

The Lorentz transformation implies that not only the velocities of material 
particles but the velocities of propagation of all physical effects are limited by c in 
deterministic physics. Consider a general process by which an event P causes an 
event Q at a velocity U > c in some frame S. In other words, information about 
the event P appears to propagate to the event Q with a superluminal velocity. 
Let us choose coordinates such that these two events occur on the £-axis with 
(finite) time and distance separations At > and Ax > 0, respectively. The time 
separation in some other inertial frame S' is given by (see Eq. (2.19d) ) 

At' = 7 (At - vAx/c 2 ) = 7 At(l - vU/c 2 ). (2.27) 

Thus, for sufficiently large v < c we obtain At' < 0; i.e., there exist inertial frames 
in which cause and effect appear to be reversed. Of course, this is impossible in 
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deterministic physics. It follows, therefore, that information can never appear to 
propagate with a superluminal velocity in any inertial frame, otherwise causality 
would be violated. 

2.4 Tensors 

It is now convenient to briefly review the mathematics of tensors. Tensors are 
of primary importance in connection with coordinate transforms. They serve to 
isolate intrinsic geometric and physical properties from those that merely depend 
on coordinates. 

A tensor of rank r in an n-dimensional space possesses n r components which 
are, in general, functions of position in that space. A tensor of rank zero has 
one component A and is called a scalar. A tensor of rank one has n components 
(Ai, A2, • • • , A n ) and is called a vector. A tensor of rank two has n 2 components, 
which can be exhibited in matrix format. Unfortunately, there is no convenient 
way of exhibiting a higher rank tensor. Consequently, tensors are usually repre- 
sented by a typical component; e.g., we talk of the tensor A^ (rank 3) or the 
tensor A^i (rank 4), etc. The suffixes k, - • • are always understood to range 
from 1 to n. 

For reasons which will become apparent later on, we shall represent tensor 
components using both superscripts and subscripts. Thus, a typical tensor might 
look like A 13 (rank 2), or B l - (rank 2), etc. It is convenient to adopt the Einstein 
summation convention. Namely, if any suffix appears twice in a given term, once 
as a subscript and once as a superscript, a summation over that suffix (from 1 to 
n) is implied. 

To distinguish between various coordinate systems we shall use primed and 
multiply primed suffixes. A first system of coordinates (x 1 ^ 2 , • • • ,x n ) can then 
be denoted by x l , a second system (x 1 ,x 2 ,---,x n ) by x l , etc. Similarly the 
general components of a tensor in various coordinate systems are distinguished 
by their suffixes. Thus, the components of some third rank tensor are denoted 
Aijk in the x l system, by Ai>j>k> in the x l system, etc. 

When making a coordinate transformation from one set of coordinates x l to 
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another x l , it is assumed that the transformation in non-singular. In other words, 
the equations which express the x l in terms of the x l can be inverted to express 
the x l in terms of the x l . It is also assumed that the functions specifying a 
transformation are differential) le. It is convenient to write 



dx l 
dx l 
dx l 
dx^ 

Note that 



Pi , (2.28a) 
p\,. (2.28b) 



p\,p\» = (2.29a) 
PW- = 6) (2.29b) 

by the chain rule, where Sj (the Kronecker delta ) equals 1 or when i = j or 
i ^ j, respectively. 

The formal definition of a tensor is as follows: 

(i) An entity having components Aij... k in the x l system and Ai>j>... k > in the x l 
system is said to behave as a covariant tensor under the transformation x l — > x l 
if 

A vy ... k , =A ij ... k pi,p 3 jl ---p k k ,. (2.30) 

(ii) Similarly, A lJ "" k is said to behave as a contravariant tensor under x l — > x l if 

A i>j>... k > = #-ypj:...p['. (2.3i) 

(Hi) Finally, A l k '[ 3 t is said to behave as a mixed tensor (contravariant in i • • • j and 
covariant in k ■ ■ ■ I) under x l — > x l if 

When an entity is described as a tensor it is generally understood that it 
behaves as a tensor under all non-singular differentiable transformations of the 
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relevant coordinates. An entity which only behaves as a tensor under a cer- 
tain subgroup of non-singular differentiable coordinate transformations is called 
a qualified tensor, because its name is conventionally qualified by an adjective 
recalling the subgroup in question. For instance, an entity which only exhibits 
tensor behaviour under Lorentz transformations is called a Lorentz tensor or, 
more commonly, a 4-tensor. 

When applied to a tensor of rank zero (a scalar), the above definitions imply 
that A 1 = A. Thus, a scalar is a function of position only, and is independent of 
the coordinate system. A scalar is often termed an invariant. 

The main theorem of tensor calculus is as follows: 



If two tensors of the same type are equal in one coordinate system, 
then they are equal in all coordinate systems. 



The simplest example of a contravariant vector (tensor of rank one) is provided 
by the differentials of the coordinates, dx l , since 

dx e = ^$-dx { = dx i p^. (2.33) 
dx l 1 v ' 

The coordinates themselves do not behave as tensors under all coordinate trans- 
formations. However, since they transform like their differentials under linear 
homogeneous coordinate transformations, they do behave as tensors under such 
transformations. 

The simplest example of a covariant vector is provided by the gradient of a 
function of position (f> = 4>{x x , • • • , x n ). Since, if we write 

* - ^ (2-34) 

then we have 

d(f> dcj) dx l 

^ = w = d?w = ^'- (2 - 35) 

An important example of a mixed second rank tensor is provided by the 
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Kronecker delta introduced previously. Since, 



£t i' 3 i' 3 

°jPiPj> =PjPj> 




(2.36) 



Tensors of the same type can be added or subtracted to form new tensors. 
Thus, if Aij and Bij are tensors, then dj = Aij ± Bij is a tensor of the same 
type. Note that the sum of tensors at different points in space is not a tensor if 
the p's are position dependent. However, under linear coordinate transformations 
the p's are constant, so the sum of tensors at different points behaves as a tensor 
under this particular type of coordinate transformation. 

If A 13 and B^ are tensors, then C klm = A lJ Bj-im is a tensor of the type 
indicated by the suffixes. The process illustrated by this example is called outer 
multiplication of tensors. 

Tensors can also be combined by inner multiplication, which implies at least 
one dummy suffix link. Thus, C kl = A 13 B^i and Ck = A lJ Bijk are tensors of the 
type indicated by the suffixes. 

Finally, tensors can be formed by contraction from tensors of higher rank. 
Thus, if A k 3 lm is a tensor then C kl = A\ kl and Ck = A l ki - are tensors of the type 
indicated by the suffixes. The most important type of contraction occurs when 
no free suffixes remain: the result is a scalar. Thus, A\ is a scalar provided that 
A 3 is a tensor. 

Although we cannot usefully divide tensors, one by another, an entity like 
C lJ in the equation A 3 = C lJ Bi, where A 1 and B{ are tensors, can be formally 
regarded as the quotient of A 1 and Bi. This gives the name to a particularly 
useful rule for recognizing tensors, the quotient rule. This rule states that if a set 
of components, when combined by a given type of multiplication with all tensors 
of a given type yields a tensor, then the set is itself a tensor. In other words, if 
the product A 1 = C lJ Bj transforms like a tensor for all tensors Bi then it follows 
that C lJ is a tensor. 



Let 



dx m 



= A 



i—3 

k---l,m ' 



(2.37) 
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Then if A l k [[ J t is a tensor, differentiation of the general tensor transformation 
(2.32) yields 

4rS, m > = 4::Upf • --PjA ■ • -A vz> + Pi + ^ + ■ ■ ■ , (2.38) 

where P\,P2, etc., are terms involving derivatives of the p's. Clearly, A^.'.'^ is 
not a tensor under a general coordinate transformation. However, under a linear 

coordinate transformation (p's constant) A^',''^, m , behaves as a tensor of the type 
indicated by the suffixes, since the Pi,P2, etc., all vanish. Similarly, all higher 
partial derivatives, 

• • dA 1 '"^ 

A limn = dx m k d ^n ( 2 - 39 ) 

etc., also behave as tensors under linear transformations. Each partial differenti- 
ation has the effect of adding a new covariant suffix. 

So far the space to which the coordinates x l refer has been without structure. 
We can impose a structure on it by defining the distance between all pairs of 
neighbouring points by means of a metric 

ds 2 = gijdx'dxi, (2.40) 

where the gij are functions of position. We can assume that gij = gji without 
loss of generality. The above metric is analogous to, but more general than, the 
metric of Euclidian n-space, ds 2 = (dx 1 ) 2 + (dx 2 ) 2 + • — h (dx n ) 2 . A space whose 
structure is determined by a metric of the type (2.40) is called Riemannian. Since 
ds 2 is invariant, it follows from a simple extension of the quotient rule that gij 
must be a tensor. It is called the metric tensor. 

The elements of the inverse of the matrix are denoted by g lJ . These 
elements are uniquely defined by the equations 

g ij gjk = sl (2.41) 

It is easily seen that the g lJ constitute the elements of a contravariant tensor. This 
tensor is said to be conjugate to g^. The conjugate metric tensor is symmetric 
(i.e., g lJ = g Jl ) just like the metric tensor itself. 
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The tensors gij and g tJ allow us to introduce the important operations of 
raising and lowering suffixes. These operations consist of forming inner products 
of a given tensor with or g l K For example, given a contravariant vector A 1 , 
we define its covariant components Ai by the equation 

A i = g ij A 3 . (2.42) 

Conversely, given a covariant vector Bi, we can define its contravariant compo- 
nents B l by the equations 

B^g^Bj. (2.43) 

More generally, we can raise or lower any or all of the free suffixes of any given 
tensor. Thus, if Aij is a tensor we define A l j by the equation 

A i j =g i ^A pj . (2.44) 

Note that once the operations of raising and lowering suffixes has been defined 
the order of raised suffixes relative to lowered suffixes becomes significant. 

By analogy with Euclidian space we define the squared magnitude (A) 2 of a 
vector A 1 with respect to the metric gijdx l dx 3 by the equation 

(A) 2 = gijA'A* = AiA\ (2.45) 

A vector A 1 termed a null vector if (A) 2 = 0. Two vectors A 1 and B l are said to 
be orthogonal if their inner product vanishes, i.e., if 

g ij A i B j = AiB 1 = A l Bi = 0. (2.46) 

Finally, let us consider differentiation with respect to distance s. The tangent 
vector dx l /ds to a given curve in space is a contravariant tensor, since 



dx 1 ' dx l dx l dx l ,. 
~ds~ = ~dx J ~ds~ = ~ds~ Pi ' ( " ^ 

The derivative d(A 1 "' 3 '&.../) / 'ds of some tensor with respect to distance is not, in 
general, a tensor, since 

ds = A fe -^^7' (2 - 48) 
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and, as we have seen, the first factor on the right is not generally a tensor. 
However, under linear transformations it behaves as a tensor, so under linear 
transformations the derivative of a tensor with respect to distance behaves as a 
tensor of the same type. 

2.5 Transformations 

In this course we shall only concern ourselves with coordinate transformations 
which transform an inertial frame into another inertial frame. This limits us to 
four classes of transformations: displacements of the coordinate axes, rotations 
of the coordinate axes, parity reversals (i.e., x,y,z — > —x, —y, —z), and Lorentz 
transformations. All of these transformations possess group properties. As a re- 
minder, the requirements for an abstract multiplicative group are: 

(i) The product of two elements is an element of the group. 

(ii) The associative law (ab)c = a(bc) holds. 

(Hi) There is a unit element e satisfying ae = ea = a for all a. 

(iv) Each element a possesses an inverse a -1 such that a _1 a = cm -1 = e. 

Consider Lorentz transformations (in the standard configuration). It is easily 
demonstrated that the resultant of two successive Lorentz transformations, with 
velocities v\ and i>2, respectively, is equivalent to a Lorentz transformation with 
velocity v = (vi+V2)/(l + ViV2/c 2 ). Lorentz transformations obviously satisfy the 
associative law. The unit element of the transformation group is just a Lorentz 
transformation with v = 0. Finally, the inverse of a Lorentz transformation with 
velocity v is a transformation with velocity —v. We can use similar arguments 
to show that translations, rotations, parity inversions, and general Lorentz trans- 
formations (i.e., transformations between frames which are not in the standard 
configuration) also possess group properties. 

If we think carefully, we can see that the group properties of the above men- 
tioned transformations are a direct consequence of the relativity principle. Let 
us again consider Lorentz transformations. Suppose that we have three iner- 
tial frames S, S' , and S" . According to (i), if we can get from S to S' by a 
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Lorentz transformation, and from S' to S" by a second Lorentz transformation, 
then it must always be possible to go directly from S to S" by means of a third 
Lorentz transformation. Suppose, for the sake of argument, that we can find 
three frames for which this is not the case. In this situation, the frame S' could 
be distinguished from the frame S" because it is possible to make a direct Lorentz 
transformation from S to the former frame, but not to the latter. This violates 
the relativity principle and, therefore, this situation can never arise. We can use 
a similar argument to demonstrate that a Lorentz transformation must possess 
an inverse. The associative law and the requirement that a unit element exists 
are trivially satisfied. 



2.6 The physical significance of tensors 



One of the central tenets of physics is that experiments should be repeatable. 
In other words, if somebody performs a physical experiment today and obtains 
a certain result, then somebody else performing the same experiment next week 
ought to obtain the same result, within the experimental errors. Presumably, in 
performing these hypothetical experiments both experimentalists find it neces- 
sary to set up a coordinate frame. Usually, these two frames do not coincide. 
After all, the experiments are, in general, performed in different places and at 
different times. Also, the two experimentalists are likely to orientate their coor- 
dinate axes differently. For instance, one experimentalist might align his x-axis 
with the North Star, whilst the other might align the same axis to point towards 
Mecca. Nevertheless, we still expect both experiments to yield the same result. 
What exactly do we mean by this statement? We do not mean that both ex- 
perimentalists will obtain the same numbers when they measure something. For 
instance, the numbers used to denote the position of a point (i.e., the coordinates 
of the point) are, in general, different in different coordinate frames. What we do 
expect is that any physically significant interrelation between physical quantities 
(ie., position, velocity, etc.) which appears to hold in the coordinate system of 
the first experimentalist will also appear to hold in the coordinate system of the 
second experimentalist. We usually refer to such interrelationships as "laws of 
physics." So, what we are really saying is that the laws of physics do not depend 
on our choice of coordinate system. In particular, if a law of physics is true in one 
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coordinate system then it is automatically true in every other coordinate system, 
subject to the proviso that both coordinate systems are inertial. 



Recall that tensors are geometric objects which possess the property that if 
a certain interrelationship holds between various tensors in one particular co- 
ordinate system, then the same interrelationship holds in any other coordinate 
system which is related to the first system by a certain class of transformations. 
It follows that the laws of physics are expressible as interrelationships between 
tensors. In special relativity the laws of physics are only required to exhibit 
tensor behaviour under transformations between different inertial frames; i.e., 
translations, rotations, and Lorentz transformations. This set of transformations 
forms a group known as the Poincare group. Parity inversion is a special type of 
transformation, and will be dealt with later on. In general relativity the laws of 
physics are required to exhibit tensor behaviour under all non-singular coordinate 
transformations. 

Consider Newton's first law of motion. These take the form of three differential 
equations, 



d 2 x 
d 2 z 



= fx, (2.49a) 
m^-f = f y , (2.49b) 



m 



dt 2 



/*, (2.49c) 



in a general inertial frame. However, we can also write them as a single vector 
differential equation, 

™g = /. (2.50) 

What is the advantage of the vector notation? Many people would say that it is 
just a convenient form of shorthand. However, there is another, far more impor- 
tant, advantage. Before we can accept Newton's first law of physics as a proper 
law of physics we need to convince ourselves that it is coordinate independent; 
i.e., that it also holds in coordinate frames which are related to the original frame 
via a general translation or rotation of the coordinate axes. It is indeed possible 
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to prove this, but the demonstration is rather tedious because a general rotation 
is a rather complicated transformation. A vector is a geometric object (in fact, 
it is a rank one tensor in three dimensional Euclidean space) whose three compo- 
nents transform under a general translation and rotation of the coordinate axes 
in an analogous manner to the difference in coordinates between two fixed points 
in space. This ensures that any vector equation which is true in one coordinate 
frame is also true in any other coordinate frame which is related to the original 
frame via a general rotation or translation of the axes. Thus, the main advantage 
of Eq. (2.50) is that it makes the coordinate independent nature of Newton's 
first law of motion manifestly obvious. Of course, we cannot deny that Newton's 
first law also looks simpler when it is expressed in terms of vectors. This is one 
example of a rather general feature of physical laws. Namely, when the laws of 
physics are expressed in a manner which makes their invariance under various 
transformation groups manifest then they tend to take a particularly simple form. 
In general, the larger the group of transformations the simpler the form taken by 
the laws of physics. One of the major goals of modern physics is to find the largest 
possible group of transformations under which the laws of physics are invariant, 
and then prove that when expressed in a manner which makes this invariance 
manifest these laws reduce to a single unifying principle. 

We already know how to write the laws of physics in terms of vectors and vec- 
tor fields. This means that these laws are automatically invariant under transla- 
tions and rotations. However, according to the relativity principle, there is a third 
class of transformations under which the laws of physics must also be invariant; 
namely, Lorentz transformations. There are two ways in which we could verify 
that the laws of physics are Lorentz invariant. The direct method is extremely 
tedious, since Lorentz transformations are rather complicated. An alternative 
method is to write the laws of physics in terms of geometric objects which trans- 
form as tensors under translations, rotations, and Lorentz transformations. This 
method has the advantage that it makes the Lorentz invariant nature of the laws 
of physics obvious. We also expect that when the laws of physics are written in 
manifestly Lorentz invariant form then they will look even simpler than they do 
when written just in terms of vectors. The laws of electromagnetism provide a 
particularly good illustration of this effect. 
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2.7 Space- time 



In special relativity we are only allowed to use inertial frames to assign coordi- 
nates to events. There are many different types of inertial frames. However, it 
is convenient to adhere to those with standard coordinates. That is, spatial co- 
ordinates which are right-handed rectilinear Cartesians based on a standard unit 
of length and time-scales based on a standard unit of time. We shall continue to 
assume that we are employing standard coordinates. However, from now on we 
shall make no assumptions, unless specifically stated, about the relative configu- 
ration of the two sets of spatial axes and the origins of time when dealing with 
two inertial frames. Thus, the most general transformation between two inertial 
frames consists of a Lorentz transformation in the standard configuration plus a 
translation (this includes a translation in time) and a rotation of the coordinate 
axes. The resulting transformation is called a general Lorentz transformation, 
as opposed to a Lorentz transformation in the standard configuration which will 
henceforth be termed a standard Lorentz transformation. 

In Section 2.2 we proved quite generally that corresponding differentials in 
two inertial frames S and S' satisfy the relation 

dx 2 + dy 2 + dz 2 - c 2 dt 2 = dx' 2 + dy' 2 + dz' 2 - c 2 dt' 2 . (2.51) 

Thus, we expect this relation to remain invariant under a general Lorentz trans- 
formation. Since such a transformation is linear it follows that 

(x 2 - x x ) 2 + (y 2 - yi) 2 + (z 2 - z x ) 2 - c 2 (t 2 - h) 2 = 
(4 - x[) 2 + (y' 2 - y[) 2 + (4 - z[) 2 - c 2 (t' 2 - t[) 2 , (2.52) 

where [x\, y\, z\, ti) and (#2, 2/2, ^2, £2) are the coordinates of any two events in 
S and the primed symbols denote the corresponding coordinates in S' . It is 
convenient to write 

-dx 2 - dy 2 - dz 2 + c 2 dt 2 = ds 2 , (2.53) 

and 

-(x 2 - zi) 2 - (3/2 - yif - (z 2 - z x ) 2 + c 2 (t 2 - h) 2 = s 2 . (2.54) 

The differential ds, or the finite number s, defined by these equations is called the 
interval between the corresponding events. Equations (2.51) and (2.52) express 
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the fact that the interval between two events is invariant, in the sense that it has 
the same value in all inertial frames. In other words, the interval between two 
events is invariant under a general Lorentz transformation. 

Let us consider entities defined in terms of four variables 

x 1 = x, x 2 = y, x 3 = z, x 4 = ct, (2.55) 

and which transform as tensors (see Eqs. (2.30) (2.32) ) under a general Lorentz 
transformation. From now on such entities will be referred to as J^-tensors. 

Tensor analysis cannot proceed very far without the introduction of a non- 
singular tensor g^, the so-called fundamental tensor, which is used to define 
the operations of raising and lowering suffixes (see Eqs. (2.42) (2.44)). The 
fundamental tensor is usually introduced using a metric ds 2 = gij dx l dx^ , where 
ds 2 is a differential invariant. We have already come across such an invariant, 
namely 

ds 2 = -dx 2 - dy 2 - dz 2 + c 2 dt 2 

= -(dx 1 ) 2 - (dx 2 ) 2 - (dx 3 ) 2 + (dx 4 ) 2 

= g^dx^dx", (2.56) 

where fi, v run from 1 to 4. Note that the use of Greek suffixes is conventional 
in 4-tensor theory. Roman suffixes are reserved for tensors in three dimensional 
Euclidian space, so-called 3-tensors. The 4-tensor has the components gn = 
922 = 933 = I? ^44 = 1, and 

9[ii> — when \i ^ v , in all permissible coordinate 
frames. From now on g^ u , as defined above, is adopted as the fundamental 
tensor for 4-tensors. g^ can be thought of as the metric tensor of the "space" 
whose points are the events (x 1 , x 2 , x 3 , x 4 ) . This "space" is usually referred to 
as space-time, for obvious reasons. Note that space-time cannot be regarded as a 
straightforward generalization of Euclidian 3-space to four dimensions, with time 
as the fourth dimension. The distribution of signs in the metric ensures that the 
time coordinate x 4 is not on the same footing as the three space coordinates. 
Thus, space-time has a non-isotropic nature which is quite unlike Euclidian space 
with its positive definite metric. According to the relativity principle, all physical 
laws are expressible as interrelationships between 4-tensors in space-time. 
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A tensor of rank one is called a J^-vector. We shall also have occasion to use 
ordinary vectors in three dimensional Euclidian space. Such vectors are called 
3-vectors and are conventionally represented by boldface symbols. We shall use 
the Latin suffixes k, etc. to denote the components of a 3- vector; these suffixes 
are understood to range from 1 to 3. Thus, u = u l = dx l /dt denotes a velocity 
vector. For 3-vectors we shall use the notation u l = ui interchangeably; i.e., the 
level of the suffix has no physical significance. 

When tensor transformations from one frame to another actually have to be 
computed, we shall usually find it possible to choose coordinates in the standard 
configuration, so that the standard Lorentz transform applies. Under it, any con- 
travariant 4- vector T M transforms according to the same scheme as the difference 
in coordinates between two points in space-time. It follows that 



T 1 ' = 

rp3' 

J> 4 ' 



7(2* -/3T 4 ), 

1 

rp3 

1 

7(2* -/3T 1 ), 



(2.57a) 
(2.57b) 
(2.57c) 
(2.57d) 



where (3 = v/c. Higher rank 4-tensors transform according to the rules (2.30) 
(2.32). The transformation coefficients take the form 
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(2.58a) 



(2.58b) 



Often the first three components of a 4- vector coincide with the components of 
a 3- vector. For example, the x 1 , x 2 , x 3 in ) are the components 

of r, the position 3-vector of the point at which the event occurs. In such cases 
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we adopt the notation exemplified by W 1 = (r, ct). The covariant form of such a 
vector is simply R^ = (— r,ct). The squared magnitude of the vector is (-R) 2 = 
R^RV = -r 2 + cH 2 . The inner product g^R»Q v = R^ of R» with a similar 
vector = (q, k) is given by R^Q^ = — r q + ct k. The vectors R^ and Q M are 
said to be orthogonal if R^Q 1 ^ = 0. 

Since a general Lorentz transformation is a linear transformation, the partial 
derivative of a 4-tensor is also a 4-tensor; 

\ = A V °. (2.59) 

Clearly, a general 4-tensor acquires an extra covariant index after partial differ- 
entiation with respect to the contravariant coordinate . It is helpful to define 
a covariant derivative operator 

where 

dpA" = A"V (2.61) 
There is a corresponding contravariant derivative operator 

^4rKs)' (2 - 62) 

where 

d»A va ee fr )T . (2.63) 
The 4-divergence of a 4- vector = (A, ^4°) is the invariant 

1 BA° 

d»A„ = dp A* = V-A + - — . (2.64) 

The four dimensional Laplacian operator, or d'Alembertian, is equivalent to the 
invariant contraction 

n = B^ = -^ + ~. (2.65) 
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Recall that we still need to prove (from Section 2.2) that the invariance of the 
differential metric, 

ds 2 = dx' 2 + dy' 2 + dz' 2 - c 2 dt' 2 = dx 2 + dy 2 + dz 2 - c 2 dt 2 , (2.66) 

between two general inertial frames implies that the coordinate transformation 
between such frames is necessarily linear. To put it another way, we need to 
demonstrate that a transformation which transforms a metric g^ v dx lx dx v with 
constant coefficients into a metric g^ v i dx^ dx v with constant coefficients must 
be linear. Now 

9^ = Qu'v'vtfPu ■ ( 2 -67) 
Differentiating with respect to x a we get 



9»>v> V%V V V + 9* v > p£ Plo = 0, (2.68) 
where 

etc. Interchanging the indices /i and a yields 

9** V^vi + 9** vivi» = 0. (2.70) 
Interchanging the indices v and a gives 

9*'* PvPv'v + 9** vivia = 0, (2.71) 

where the indices \j! and v' have been interchanged in the first term. It follows 
from Eqs. (2.68), (2.70), and (2.71) that 

9^'vi^ =0. (2.72) 

Multiplication by p v a , yields 

9 i i>v> V*iaViv V o> = 9»>a> V^a = °" ( 2 -73) 

Finally, multiplication by g v a gives 

V.'/>;l=i = 0' (2-74) 

This proves that the coefficients p 1 ^ are constants and, hence, that the transfor- 
mation is linear. 
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2.8 Proper time 



It is often helpful to write the invariant differential interval ds 2 in the form 

ds 2 = c 2 dr 2 . (2.75) 

The quantity dr is called the proper time. It follows that 

, 9 dx 2 + dy 2 + dz 2 , 9 , n „ N 

dr 2 = 9 + tft 2 . (2.76) 

Consider a series of events on the world-line of some material particle. If the 
particle has speed u then 



dr 2 = dr 

implying that 



dx 2 + dy 2 + dz 2 
cHt 2 + 



u 2 



^(1-^-1, (2.77) 



J = 7H- (2-78) 

It is clear that dt = dr in the particle's rest frame. Thus, dr corresponds to the 
time difference between two neighbouring events on the particle's world-line, as 
measured by a clock attached to the particle (hence, the name "proper time"). 
According to Eq. (2.78), the particle's clock appears to run slow, by a factor 7 (it), 
in an inertial frame in which the particle is moving with velocity u. This is the 
celebrated time dilation effect. 

Let us consider how a small 4-dimensional volume element in space-time trans- 
forms under a general Lorentz transformation. We have 

d 4 x' = Jd 4 x, (2.79) 

where 

_ djx 1 '^ 2 '^ 3 '^ 4 ') 

J ~ d(x\x 2 , X 3,X*) 

is the Jacobian of the transformation; i.e., the determinant of the transformation 
matrix p^ . A general Lorentz transformation is made up of a standard Lorentz 



30 



transformation plus a displacement and a rotation. Thus, the transformation ma- 
trix is the product of that for a standard Lorentz transformation, a translation, 
and a rotation. It follows that the Jacobian of a general Lorentz transformation 
is the product of that for a standard Lorentz transformation, a translation, and a 
rotation. It is well known that the Jacobian of the latter two transformations is 
unity, since they are both volume preserving transformations which do not affect 
time. Likewise, it is easily seen (e.g., by taking the determinant of the transfor- 
mation matrix (2.58a) ) that the Jacobian of a standard Lorentz transformation 
is also unity. It follows that 

dV = d 4 x (2.81) 

for a general Lorentz transformation. In other words, a general Lorentz transfor- 
mation preserves the volume of space-time. Since time is dilated by a factor 7 in 
a moving frame, the volume of space-time can only be preserved if the volume of 
ordinary 3-space is reduced by the same factor. As is well known, this is achieved 
by length contraction along the direction of motion by a factor 7. 



2.9 4-velocity and 4-acceleration 

We have seen that the quantity dx^/ds transforms as a 4- vector under a general 
Lorentz transformation (see Eq. (2.47) ). Since ds oc dr it follows that 

dr^ 

U fi = ^- (2.82) 
dr 

also transforms as a 4-vector. This quantity is known as the 4-velocity. Likewise, 
the quantity 

AU dU ^ /~ooN 

~ ~dr^~ = ~dr~ (2 ' 83 > 
is a 4-vector, and is called the 4- acceleration. 

For events along the world-line of a particle traveling with 3-velocity u we 
have 
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where use has been made of Eq. (2.78). This gives the relationship between a par- 
ticle's 3-velocity and its 4-velocity. The relationship between the 3-acceleration 
and the 4-acceleration is less straightforward. We have 



A» = 



dU» 



dr 



= 7 



dU» 



dt 




(2.85) 



where a = du/dt is the 3-acceleration. In the rest frame of the particle = (0, c) 
and A» = (a, 0). It follows that 



(note that U^A^ is an invariant quantity). In other words, the 4-acceleration of 
a particle is always orthogonal to its 4-velocity. 

2.10 The current density 4- vector 

Let us now consider the laws of electromagnetism. We wish to demonstrate that 
these laws are compatible with the relativity principle. In order to achieve this it 
is necessary for us to make an assumption about the transformation properties of 
electric charge. The assumption which we shall make, which is well substantiated 
experimentally, is that charge, unlike mass, is invariant. That is, the charge car- 
ried by a given particle has the same measure in all inertial frames. In particular, 
the charge carried by a particle does not vary with the particle's velocity. 

Let us suppose, following Lorentz, that all charge is made up of elementary 
particles, each carrying the invariant amount e. Suppose that n is the number 
density of such charges at some given point and time, moving with velocity it, 
as observed in a frame S. Let no be the number density of charges in the frame 
So in which the charges are momentarily at rest. As is well known, a volume of 
measure V in S has measure ^(u) V in So (because of length contraction). Since 
observers in both frames must agree on how many particles are contained in the 
volume, and, hence, on how much charge it contains, it follows that n = ^(u) no- 
If p = en and po = eno are the charge densities in S and /So, respectively, then 



U U A» = 



(2.86) 



P = l(u)p Q . 



(2.87) 
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The quantity po is called the proper density and is obviously Lorentz invariant. 

Suppose that are the coordinates of the moving charge in S. The current 
density J^-vector is constructed as follows: 

dr^ 

J» = Po-r- = PoU» . (2.88) 
or 

Thus, 

J» = p 0l {u){u,c) = {j,pc), (2.89) 

where j = pu is the current density 3-vector. Clearly, charge density and current 
density transform as the time-like and space-like components of the same 4- vector. 

Consider the invariant 4-divergence of J M : 

<V" = V-j + ^. (2.90) 

We know that one of the caveats of Maxwell's equations is the charge conservation 
law 

^ + V-i = 0. (2.91) 

It is clear that this expression can be rewritten in the manifestly Lorentz invariant 
form 

d„ J» = 0. (2.92) 

This equation tells us that there are no net sources or sinks of electric charge in 
nature; i.e., electric charge is neither created nor destroyed. 



2.11 The potential 4- vector 

There are many ways of writing the laws of electromagnetism. However, the most 
obviously Lorentz invariant way is to write them in terms of the vector and scalar 
potentials. When written in this fashion, Maxwell's equations reduce to 

2 £V = w 

)a = /i j, (2.93b) 
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where 4> is the scalar potential and A is the vector potential. Note that the differ- 
ential operator appearing in these equations is the Lorentz invariant d' Alembertian, 
defined in Eq. (2.65). The above pair of equations can be rewritten in the form 

n<f> = JH, (2.94a) 
ceo 

UcA = —. (2.94b) 
ceo 

Maxwell's equations can be written in Lorentz invariant form provided that the 
entity 

<P» = {cA,(f)) (2.95) 

transforms as a contravariant 4- vector. This entity is known as the potential 
4-vector. It follows from Eqs. (2.89), (2.94), and (2.95) that 

= . (2.96) 

ce 

Thus, the field equations which govern classical electromagnetism can all be 
summed up in a single 4-vector equation. 



2.12 Gauge invariance 

The electric and magnetic fields are obtained from the vector and scalar potentials 
according to the prescription 

dA 

E = -V0--^, (2.97a) 
B = VAi. (2.97b) 

These fields are important because they determine the electromagnetic forces 
exerted on charged particles. Note that the above prescription does not uniquely 
determine the two potentials. It is possible to make the following transformation, 
known as a gauge transformation, which leaves the fields unaltered: 

4> -> <£+^> ( 2 - 98a ) 
A -> A-W, (2.98b) 
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where ip(r,t) is a general scalar field. It is necessary to adopt some form of con- 
vention, generally known as a gauge condition, to fully specify the two potentials. 
In fact, there is only one gauge condition which is consistent with Eqs. (2.93). 
This is the Lorentz gauge condition, 

if + V.A = 0. (2.99) 

Note that this condition can be written in the Lorentz invariant form 

= 0. (2.100) 

This implies that if the Lorentz gauge holds in one particular inertial frame then 
it automatically holds in all other inertial frames. A general gauge transformation 
can be written 

$n -xf^ + cd'V- (2.101) 

Note that even after the Lorentz gauge has been adopted the potentials are un- 
determined to a gauge transformation using a scalar field ip which satisfies the 
sourceless wave equation 

= 0. (2.102) 

However, if we adopt "sensible" boundary conditions in both space and time then 
the only solution to the above equation is ip = 0. 



2.13 Solution of the inhomogeneous wave equation 

Equations (2.93) all have the general form 

□tf(r,t) =g(r,t). (2.103) 

Can we find a unique solution to the above equation? Let us assume that the 
source function g(r,t) can be expressed as a Fourier integral 

/oo 
gLU (r)e- iojt duj. (2.104) 
-oo 
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The inverse transform is 

1 f°° 

9«{r) = — J g(r,t)e lb *dt. (2.105) 
Similarly, we may write the general potential ip(r,t) as a Fourier integral 

/oo 

V> w (r)e- iwt dw, (2.106) 
-oo 

with the corresponding inverse 

i r°° 

^W = — y tf(r,t)e ,w *dt. (2.107) 

Fourier transformation of Eq. (2.103) yields 

(V 2 + fc 2 ty w = -&,, (2.108) 

where k = uj/c. 

The above equation, which reduces to Poisson's equation in the limit k — > 0, 
and is called Helmholtz's equation, is linear, so we may attempt a Green's function 
method of solution. Let us try to find a function G w (r,r') such that 

(V 2 + k 2 )G u (ry) = -5(r - r'). (2.109) 

The general solution is then 

Mr) = J 9 u {r')G u {ry)dV'. (2.110) 

The "sensible" spatial boundary conditions which we impose are that G u (r, r') — >■ 
as |r — r'\ — > oo. In other words, the field goes to zero a long way from the 
source. Since the system we are solving is spherically symmetric about the point 
r' it is plausible that the Green's function itself is spherically symmetric. It 
follows that 



R dR? 



+ k z Gu = -5{R), (2.111) 
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where R = r — r' and R = \R\. The most general solution to the above equation 
in the region R > is 1 



G U (R) 



A, 



ikR 



+ Be 



-ikR 



AtiR 



(2.112) 



We know that in the limit k — > the Green's function for Helmholtz's equation 
must tend towards that for Poisson's equation, which is 



G U (R) = 



AttR 



(2.113) 



This is only the case if A + B = 1. 

Reconstructing i/)(r,t) from Eqs. (2.106), (2.110), and (2.112), we obtain 




4tt J J R 
It follows from Eq. (2.104) that 



Ae -\ u {t-R/c) + Be -iu(t+R/c) 



dwdV. (2.114) 



A f g(r',t-R/c) JT „ , B f g{r',t + R/c) _ 



dK' + — 

47T 



dV" 



(2.115) 



Now, the real space Green's function for the inhomogeneous wave equation 
(2.103) satisfies 

nG{r,r , ;t,t , ) = 6{r-r , )6{t-t'). (2.116) 
Hence, the most general solution of this equation takes the form 

i>(r,t) = J J ' g{r'J)G(ry;tJ)dV'M. (2.117) 

Comparing Eqs. (2.115) and (2.117) we obtain 

G{r, r';t, t') = A G (+) (r, r'; t, t') + B G (_) (r, r'; *'), (2.118) 

x In principle, A = A(u;) and B = B(u>), with A + B = 1. However, later on we shall 
demonstrate that B = 0, otherwise causality is violated. It follows that A = 1. Thus, it is 
legitimate to assume, for the moment, that A and S are constants. 
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where 

G ( ± ) (r , r . it , = tl^M, (2 .n9) 

47r \r — r'\ 

and A + B=l. 

The real space Green's function specifies the response of the system to a point 
source at position r' which appears momentarily at time t' . According to the 
retarded Green's function G^ the response consists of a spherical wave, centred 
on r' , which propagates forward in time. In order for the wave to reach position 
r at time t it must have been emitted from the source at r' at the retarded time 
t r = t — \r — r'l/c. According to the advanced Green's function G^ the response 
consists of a spherical wave, centred on r' , which propagates backward in time. 
Clearly, the advanced potential is not consistent with our ideas about causality, 
which demand that an effect can never precede its cause in time. Thus, the 
Green's function which is consistent with our experience is 

G(r,r'i«,0 = G«(r,r'i«,f ) = ^l^JZ^Ml . ( 2 . 120 ) 

47T \r — r | 

We are able to find solutions of the inhomogeneous wave equation (2.103) which 
propagate backward in time because this equation is time symmetric (i.e., it is 
invariant under the transformation t — > —t). 

In conclusion, the most general solution of the inhomogeneous wave equation 
(2.103) which satisfies sensible boundary conditions at infinity and is consistent 
with causality is 

This expression is sometimes written 

^ {r ' t) = I ^yh\ dV '' (2122) 

where the rectangular bracket symbol [ ] denotes that the terms inside the bracket 
are to be evaluated at the retarded time t — \r — r'\/c. Note, in particular, from 
Eq. (2.122) that if there is no source (i.e., g(r,t) = 0) then there is no field (i.e., 
ip(r,t) = 0). But, is the above solution really unique? Unfortunately, there is a 
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weak link in our derivation, between Eqs. (2.110) and (2.111), where we assume 
that the Green's function for the Helmholtz equation subject to the boundary 
condition G w (r,r') — > as \r — r'\ — > oo is spherically symmetric. Let us try to 
fix this problem. 

With the benefit of hindsight, we can see that the Green's function 

AkR 

G U {R) = ^ (2.123) 

corresponds to the retarded solution in real space and is, therefore, the correct 
physical Green's function. The Green's function 

p -ikR 

G^R) = — (2.124) 

corresponds to the advanced solution in real space and must, therefore, be re- 
jected. We can select the retarded Green's function by imposing the following 
boundary condition at infinity 

( dC \ 

lim R — - -ikG = 0. (2.125) 
R^oo \dR J v ' 

This is called the Sommerfeld radiation condition; it basically ensures that sources 
radiate waves instead of absorbing them. But, does this boundary condition 
uniquely select the spherically symmetric Green's function (2.123) as the solution 
of 

(V 2 + k 2 )G U} (R, 6, if) = -S(R)? (2.126) 

Here, (R, 0, tp) are spherical polar coordinates. If it does then we can be sure that 
Eq. (2.122) represents the unique solution of the wave equation (2.103) which is 
consistent with causality. 

Let us suppose that there are two solutions of Eq. (2.126) which satisfy the 
boundary condition (2.125) and revert to the unique Green's function for Poisson's 
equation (2.113) in the limit R — > 0. Let us call these solutions u\ and U2, and 
let us form the difference w = u\ — u<i. Consider a surface Eq which is a sphere 
of arbitrarily small radius centred on the origin. Consider a second surface 
which is a sphere of arbitrarily large radius centred on the origin. Let V denote 
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the volume enclosed by these surfaces. The difference function w satisfies the 
homogeneous Helmholtz equation, 

(V 2 + /c 2 )w = 0, (2.127) 

throughout V . According to Green's theorem 



l ( M- W ^ W ) d V= (/ +/ ) I (2.128) 



where djdn denotes a derivative normal to the surface in question. It is clear 
from Eq. (2.127) that the volume integral is zero. It is also clear that the first 
surface integral is zero, since both u\ and ui must revert to the Green's function 
for Poisson's equation in the limit R — > 0. Thus, 



fx, 



' w d^_ w ,dv\ dS = (2129) 

on on J 



Equation (2.127) can be written 



d 2 (Rw) D(Rw) , 2n n 

' + ; + k 2 Rw = 0, (2.130) 



dR? R? 
where D is the spherical harmonic operator 

}_d_( . Q d\ _L d 2 

sin 9 09 OB) + sin 2 0<V 

The most general solution of Eq. (2.130) takes the form (see Section 7) 



D = — — ( sin0 — ) + — . (2.131) 



oo 

w(R,e,<p)= Y, [Cimh ( i 1 \kR) + D lm h ( i 2) (kR)]Y lrn (e^). (2.132) 

i,m=0 



Here, the C\ m and Di m are arbitrary coefficients, the Y/ m are spherical harmonics, 2 
and 

hf' 2 \ P ) = yj^W, (2.133) 



! J.D. Jackson, Classical Electrodynamics, (Wiley, 1962), p. 99 
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where Hb 2 are Hankel functions of the first and second kind. 3 It can be demon- 



n 



strated that 4 



= ,&(,-(»«/ 2 W2) £ , (2.134a) 



where 



fl-=(p) = /X e -'(o-(»+i/2)T/2) y /"', m) , (2.134b) 



(», r») = (4 " 1)(4K 9 2 L^" {2m " 1}) (2.135) 



and (n, 0) = 1. Note that the summations in Eqs. (2.314) terminate after n + 1/2 
terms. 

(2) 

The large R behaviour of the h\ is clearly inconsistent with the Sommerfeld 
radiation condition (2.125). It follows that all of the Di m in Eq. (2.132) are zero. 
The most general solution can now be expressed in the form 

^fl,«, V ) = £^f;£M, ( 2.i36) 

n=0 

where the f n (9, </?) are various weighted sums of the spherical harmonics. Substi- 
tution of this solution into the differential equation (2.130) yields 



oo 

i h R ' 

e 

n=0 



Replacing the index of summation n in the first term of the parentheses by n + 1 
we obtain 



oo 



n=0 



3 J.D. Jackson, Classical Electrodynamics, (Wiley, 1962), p. 104 

4 A. Sommerfeld, Partial differential equations in physics, (Academic Press, New York, 1964), 
p. 117 
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which gives us the recursion relation 



2i k(n + l)/ n+ i = [n(n + 1) + £]/„. (2.139) 

It follows that if fo = then all of the / n are equal to zero. 

Let us now consider the surface integral (2.129). Since we are interested in 
the limit R — > oo we can replace w by the first term of its expansion in (2.136), 
so 

L { w ^~ w *i^) ds =- 2ik / i^°i 2 dn =°> ( 2 - M °) 

where dO, is a unit of solid angle. It is clear that fo = 0. This implies that 
fi = fi = ■ ■ ■ = and, hence, that w = 0. Thus, there is only one solution of 
Eq. (2.126) which is consistent with the Sommerfeld radiation condition, and this 
is given by Eq. (2.123). We can now be sure that Eq. (2.122) is a unique solution of 
Eq. (2.103) subject to the boundary condition (2.125). This boundary condition 
basically says that infinity is an absorber of radiation but not an emitter, which 
seems entirely reasonable. 



2.14 Retarded potentials 

Equations (2.94) have the same form as the inhomogeneous wave equation (2.103), 
so we can immediately write the solutions to these equations as 

«•■■«> = ikfv=^\ dV '> (2 ' 141a) 
= rJw^-\ dV '- (2 - 141b) 

Moreover, we can be sure that these solutions are unique, subject to the reason- 
able proviso that infinity is an absorber of radiation but not an emitter. This 
is a crucially important point. Whenever the above solutions are presented in 
physics textbooks there is a tacit assumption that they are unique. After all, if 
they were not unique why should we choose to study them instead of one of the 
other possible solutions? The uniqueness of the above solutions has a physical 
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interpretation. It is clear from Eqs. (2.141) that in the absence of any charges 
and currents there are no electromagnetic fields. In other words, if we observe 
an electromagnetic field we can be certain that if we were to trace it backward 
in time we would eventually discover that it was emitted by a charge or a cur- 
rent. In proving that the solutions of Maxwell's equations are unique, and then 
finding a solution in which all waves are emitted by sources, we have effectively 
ruled out the possibility that the vacuum can be "unstable" to the production of 
electromagnetic waves without the need for any sources. 

Equations (2.141) can be combined to form the solution of the 4- vector wave 
equation (2.96), 



Here, the components of the 4-potential are evaluated at some event P in space- 
time, r is the distance of the volume element dV from P, and the square brackets 
indicate that the 4-current is to be evaluated at the retarded time; i.e., at a time 
r/c before P. 

But, does the right-hand side of Eq. (2.142) really transform as a contravariant 
4-vector? This is not a trivial question since volume integrals in 3-space are 
not, in general, Lorentz invariant due to the length contraction effect. However, 
the integral in Eq. (2.142) is not a straightforward volume integral because the 
integrand is evaluated at the retarded time. In fact, the integral is best regarded 
as an integral over events in space-time. The events which enter the integral 
are those which intersect a spherical light wave launched from the event P and 
evolved backwards in time. In other words, the events occur before the event P 
and have zero interval with respect to P. It is clear that observers in all inertial 
frames will, at least, agree on which events are to be included in the integral, 
since both the interval between events and the absolute order in which events 
occur are invariant under a general Lorentz transformation. 

We shall now demonstrate that all observers obtain the same value of dV/r 
for each elementary contribution to the integral. Suppose that S and S' are 
two inertial frames in the standard configuration. Let unprimed and primed 
symbols denote corresponding quantities in S and S f , respectively. Let us assign 
coordinates (0,0,0,0) to P and (x,y : z,ct) to the retarded event Q for which r 




(2.142) 
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and dV are evaluated. Using the standard Lorentz transformation (2.19), the 
fact that the interval between events P and Q is zero, and the fact that both t 
and t' are negative, we obtain 



r' 



/ vx \ 

ct' = -c 7 ^-— J, (2.143) 



where v is the relative velocity between frames S' and S, 7 is the Lorentz factor, 
and r = \J x 2 + y 2 + z 2 , etc. It follows that 

r' = n I -— + — J = n {1 + -cosOj , (2.144) 

where is the angle (in 3-space) subtended between the line PQ and the x-axis. 

We now know the transformation for r. What about the transformation for 
dV? We might be tempted to set dV = 'ydV, according to the usual length 
contraction rule. However, this is wrong. The contraction by a factor 7 only 
applies if the whole of the volume is measured at the same time, which is not the 
case in the present problem. Now, the dimensions of dV along the y— and z— 
axes are the same in both S and S' , according to Eqs. (2.19). For the x-dimension 
these equations give dx' = 7(cte — vdt). The extremities of dx are measured at 
times differing by dt, where 5 

dt = -— = -— cos6. (2.145) 

c c 

Thus, 

dx' = (l + V - cos#) 7 da:, (2.146) 

giving 

dV' = (l + V - cos#) ~fdV. (2.147) 

It follows from Eqs. (2.144) and (2.147) that dV'/r' = dV/r. This result will 
clearly remain valid even when S and S' are not in the standard configuration. 

5 Note that dr = dx cos 6, despite the fact that x = r cosO. This comes about because the 
volume element dV is aligned along a radius vector. 
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Thus, dV/r is an invariant and, therefore, [J^dV/r is a contravariant 4- 
vector. For linear transformations, such as a general Lorentz transformation, the 
result of adding 4-tensors evaluated at different 4-points is itself a 4-tensor. It 
follows that the right-hand side of Eq. (2.142) is a contravariant 4- vector. Thus, 
this 4-vector equation can be properly regarded as the solution to the 4-vector 
wave equation (2.96). 




2.15 Tensors and pseudo-tensors 

The totally antisymmetric fourth rank tensor is defined 

for a, (3, 7, S any even permutation of 1, 2, 3, 4 
for a, (3, 7, S any odd permutation of 1, 2, 3, 4 
otherwise 

(2.148) 

The components of this tensor are invariant under a general Lorentz transforma- 
tion, since 

^vivUiti = " a ' (3 ' 1 ' 6 ' w'i = eaW5 '> ( 2 - 149 ) 

where \p£ | denotes the determinant of the transformation matrix, or the Jacobian 
of the transformation, which we have already established is unity for a general 
Lorentz transformation. We can also define a totally antisymmetric third rank 
tensor e ufe which stands in the same relation to 3-space as e a ^ ld does to space- 
time. It is easily demonstrated that the elements of e ufe are invariant under a 
general translation or rotation of the coordinate axes. The totally antisymmetric 
third rank tensor is used to define the cross product of two 3- vectors, 

(a A by = e ijk aj b k , (2.150) 

and the curl of a 3-vector field, 

(V A A)* = e^ k Tj^j- (2-151) 
The following two rules are often useful in deriving vector identities 

e ijk e iab = 5i5t-5>5l (2.152a) 
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e ijk e ijb = 25$. (2.152b) 



Up to now we have restricted ourselves to three basic types of coordinate trans- 
formation; namely, translations, rotations, and standard Lorentz transformations. 
An arbitrary combination of these three transformations constitutes a general 
Lorentz transformation. Let us now extend our investigations to include a fourth 
type of transformation known as a parity inversion; i.e., x,y,z,—> —x,—y,—z. 
A reflection is a combination of a parity inversion and a rotation. As is eas- 
ily demonstrated, the Jacobian of a parity inversion is —1, unlike a translation, 
rotation, or standard Lorentz transformation, which all possess Jacobians of +1. 

The prototype of all 3-vectors is the difference in coordinates between two 
points in space, r. Likewise, the prototype of all 4-vectors is the difference in 
coordinates between two events in space-time, ffl 1 = (r,ct). It is not difficult to 
appreciate that both of these objects are invariant under a parity transformation 
(in the sense that they correspond to the same geometric object before and after 
the transformation). It follows that any 3- or 4-tensor which is directly related 
to r and R^, respectively, is also invariant under a parity inversion. Such ten- 
sors include the distance between two points in 3-space, the interval between two 
points in space-time, 3- velocity, 3-acceleration, 4-velocity, 4-acceleration, and the 
metric tensor. Tensors which exhibit tensor behaviour under translations, rota- 
tions, special Lorentz transformations, and are invariant under parity inversions, 
are termed proper tensors, or sometimes polar tensors. Since electric charge is 
clearly invariant under such transformations (i.e., it is a proper scalar) it follows 
that 3-current and 4-current are proper vectors. It is also clear from Eq. (2.96) 
that the scalar potential, the vector potential, and the potential 4-vector, are 
proper tensors. 

It follows from Eq. (2.149) that e a ^ s — > — e a ^^ 5 under a parity inversion. 
Tensors like this, which exhibit tensor behaviour under translations, rotations, 
and special Lorentz transformations, but are not invariant under parity inver- 
sions (in the sense that they correspond to different geometric objects before and 
after the transformation), are called pseudo-tensors, or sometimes axial tensors. 
Equations (2.150) and (2.151) imply that the cross product of two proper vectors 
is a pseudo- vector, and the curl of a proper vector field is a pseudo- vector field. 
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One particularly simple way of performing a parity transformation is to ex- 
change positive and negative numbers on the three Cartesian axes. A proper 
vector is unaffected by such a procedure (i.e., its magnitude and direction are the 
same before and after). On the other hand, a pseudo- vector ends up pointing in 
the opposite direction after the axes are renumbered. 

What is the fundamental difference between proper tensors and pseudo-tensors? 
The answer is that all pseudo-tensors are defined according to a handedness con- 
vention. For instance, the cross product between two vectors is conventionally 
defined according to a right-hand rule. The only reason for this is that the major- 
ity of human beings are right-handed. Presumably, if the opposite were true then 
cross products etc. would be defined according to a left-hand rule and would, 
therefore, take minus their conventional values. The totally antisymmetric ten- 
sor is the prototype pseudo-tensor and is, of course, conventionally defined with 
respect to a right-handed spatial coordinate system. A parity inversion converts 
left into right and vice versa and, thereby, effectively swaps left- and right-handed 
conventions. 

The use of conventions in physics is perfectly acceptable provided that we 
recognize that they are conventions and are consistent in their use. It follows that 
laws of physics cannot contain mixtures of tensors and pseudo-tensors, otherwise 
they would depend our choice of handedness convention. 6 

Let us now consider electric and magnetic fields. We know that 

dA 

E = -V<f>-—, (2.153a) 

B = VAi. (2.153b) 

We have already seen that the scalar and the vector potential are proper scalars 
and vectors, respectively. It follows that E is a proper vector but that B is a 
pseudo- vector (since it is the curl of a proper vector) . In order to fully appreci- 
ate the difference between electric and magnetic fields let us consider a thought 

6 Here, we are assuming that the laws of physics do not possess an intrinsic handedness. This 
is certainly the case for mechanics and electromagnetism. However, the weak interaction does 
possess an intrinsic handedness; i.e., it is fundamentally different in a parity inverted universe. 
So, the equations governing the weak interaction do actually contain mixtures of tensors and 
pseudo-tensors. 
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experiment first proposed by Richard Feynman. Suppose that we are in radio 
contact with a race of aliens and are trying to explain to them our system of 
physics. Suppose, further, that the aliens live sufficiently far away from us that 
there are no common objects which we both can see. The question is this: could 
we unambiguously explain to these aliens our concepts of electric and magnetic 
fields? We could certainly explain electric and magnetic lines of force. The former 
are the paths of charged particles (assuming that the particles are subject only 
to electric fields) and the latter can be mapped out using small test magnets. We 
could also explain how we put arrows on electric lines of force to convert them 
into electric field lines: the arrows run from positive charges (i.e., charges with 
the same sign as atomic nuclei) to negative charges. This explanation is unam- 
biguous provided that our aliens live in a matter (rather than an anti-matter) 
dominated part of the universe. But, could we explain how we put arrows on 
magnetic lines of force in order to convert them into magnetic field lines? The 
answer is no. By definition, magnetic field lines emerge from the north poles of 
permanent magnets and converge on the corresponding south poles. The defini- 
tion of the north pole of a magnet is simply that it possesses the same magnetic 
polarity as the north pole of the Earth. This is obviously a convention. In fact, 
we could redefine magnetic field lines to run from the south poles to the north 
poles of magnets without significantly altering our laws of physics (we would just 
have to replace B by — B in all our equations). In a parity inverted universe a 
north pole becomes a south pole and vice versa, so it is hardly surprising that 
B^-B. 7 



2.16 The electromagnetic field tensor 

Let us now investigate whether we can write the components of the electric and 
magnetic fields as the components of some proper 4-tensor. There is an obvious 
problem here. How can we identify the components of the magnetic field, which 
is a pseudo- vector, with any of the components of a proper-4-tensor? The former 
components transform differently under parity inversion than the latter compo- 

7 Note that it would actually be possible to unambiguously communicate to our concepts of 
left and right to our hypothetical aliens using the fact that the weak interaction possesses an 
intrinsic handedness. 
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nents. Consider a proper-3-tensor whose covariant components are written Bik, 
and which is antisymmetric: 

Bij = -Bji. (2.154) 

This immediately implies that all of the diagonal components of the tensor are 
zero. In fact, there are only three independent non-zero components of such a 
tensor. Could we, perhaps, use these components to represent the components of 
a pseudo-3- vector? Let us write 

B l = \e i3k B 3k . (2.155) 

It is clear that B l transforms as a contravariant pseudo-3- vector. It is easily seen 
that 

( B z -B y \ 

(2.156) 





I 





B z 


-By \ 


B ij = B^ = 




-B z 





B x 




\ 


By 


—B x 


/ 



where B 1 = B\ = B X: etc. In this manner, we can actually write the components 
of a pseudo-3-vector as the components of an antisymmetric proper-3-tensor. In 
particular, we can write the components of the magnetic field B in terms of an 
antisymmetric proper magnetic field 3-tensor which we shall denote B^ . 

Let us now examine Eqs. (2.153) more carefully. Recall that $ M = (— cA, (f>) 
and <9 M = (V,c _1 <9/<9£). It follows that we can write Eq. (2.153a) in the form 

Ei = -d&i + dApi. (2.157) 
Equation (2.153b) can be written 

cB l = X - e^ k cBjk = -e^ k dj$ k - (2.158) 

Let us multiply this expression by Cj a ft, making use of the identity 

e iab e^ k = SiS k b -6l5 k a . (2.159) 

We obtain 

\ (B ab ~ B ha ) = -d a $ b + d b $ a , (2.160) 
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or 



since Bij = —Bji. 



cB 



-di$j + dj$i, 



(2.161) 



Let us define a proper-4-tensor whose covariant components are given by 

F llv = d ll $ v -d v $ ll . (2.162) 

It is clear that this tensor is antisymmetric: 

Fpv = —F Vfi . (2.163) 

This implies that the tensor only possesses six independent non-zero components. 
Maybe it can be used to specify the components of E and B? 



Equations (2.157) and (2.162) yield 

F Ai = d 4 $i - di$ A = Ei 
Likewise, Eqs. (2.161) and (2.162) imply that 



Fij = di$j - dj$i = -cBij. 



Thus, 



Fi 



i4 



F< 



4i 



■Ei 



«5 



F — — F — — cB 



(2.164) 

(2.165) 

(2.166a) 
(2.166b) 



In other words, the completely space-like components of the tensor specify the 
components of the magnetic field, whereas the hybrid space and time-like com- 
ponents specify the components of the electric field. The covariant components 
of the tensor can be written 



F — 






-cB z 


+ CBy 


—F x 


+cB z 





-cB x 


~Ey 


-cB y 


+cB x 





-E z 


+E X 




+E Z 






(2.167) 



50 



Not surprisingly, F^ v is usually called the electromagnetic field tensor. The above 
expression, which appears in all standard textbooks, is very misleading. Taken 
at face value, it is simply wrong! We cannot form a proper-4-tensor from the 
components of a proper-3-vector and a pseudo-3-vector. The expression only 
makes sense if we interpret B x , say, as representing the component B23 of the 
proper magnetic field 3-tensor Bij 



The contravariant components of the electromagnetic field tensor are given 



by 



_pi4 _ _pAi _ 

pij = _ F ji = _ c b 1 \ 



or 



pnv 






-cB z 


+ CBy 


+E X 


+cB z 





-cB x 


+ Ey 


-cB y 


+cB x 





+E Z 


—E x 


~Ey 


-E z 






(2.168a) 
(2.168b) 



(2.169) 



Let us now consider two of Maxwell's equations: 

P 



V-E = 



VAB 



dE 

~dt 



(2.170a) 
(2.170b) 



Recall that the 4-current is defined J M = (j,pc). The first of these equations can 
be written 

T 4 

diE 1 = diF i4 + d 4 F 44 - 



ceo 

since F 44 = 0. The second of these equations takes the form 
e^ k dj cB k - d A E { = e^ k 5,(1/2 e kab cB ab ) + d A F Ai = 
Making use of Eq. (2.159), the above expression reduces to 



J 1 

ce 



(2.171) 



(2.172) 



- dj (cB ij - cB ji ) + d 4 F 4i = dj F ji + d 4 F 4S 3 ? 
2 ceo 



(2.173) 
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Equations (2.171) and (2.173) can be combined to give 

d„F" v = — . (2.174) 
ceo 

This equation is consistent with the equation of charge continuity, d^J^ = 0, 
because of the antisymmetry of the electromagnetic field tensor. 



2.17 The dual electromagnetic field tensor 

We have seen that it is possible to write the components of the electric and 
magnetic fields as the components of a proper-4-tensor. Is it also possible to 
write the components of these fields as the components of some pseudo-4-tensor? 
It is obvious that we cannot identify the components of the proper-3-vector E 
with any of the components of a pseudo-tensor. However, we can represent the 
components of E in terms of those of an antisymmetric pseudo-3-tensor Eij by 
writing 

E i = ±e i > k E jk . (2.175) 

It is easily demonstrated that 



/ E z -E y 



E 13 = E i:j 



\ 



-E z E x | , (2.176) 

E,, —E x 



in a right-handed coordinate system. 

Consider the dual electromagnetic field tensor G^ v , which is defined 

G» v = -e^ a(3 F a(3 . (2.177) 

This tensor is clearly an antisymmetric pseudo-4-tensor. We have 

G Ai = X - e Ai3k F jk = F jk = X - e i3k cB jk = cB\ (2.178) 
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plus 



G ij = _ ( e O-fc4 Ffc4 + e O-4fc F4fc) = e ijk FkAj 



where use has been made of = —F vyi . The above expression yields 



It follows that 



G ij = -e^Ek = --e ijk e kab E ab = —E**. 



G i4 = _ G 4i = _ cB i 

Qij = _ G Ji = _ E ij 



(2.179) 

(2.180) 

(2.181a) 
(2.181b) 



or 






-E z 


+ Ey 


-cB x 


+E Z 





—E x 


-CBy 


~Ey 


+E X 





-cB z 


+cB x 


+ CBy 


+cB z 






(2.182) 



The above expression is, again, slightly misleading, since E x stands for the compo- 
nent E 23 of the pseudo-3-tensor E % i and not for an element of the proper-3- vector 
E. Of course, in this case B x really does represent the first element of the pseudo- 
3- vector B. Note that the elements of G^ v are obtained from those of F^ v by 
making the transformation c.B u — > E l i and E l — > —cB\ 

The covariant elements of the dual electromagnetic field tensor are given by 



Gi4 — —G^i — -\-cBi, 



Gi 



G 
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E, 



l J 1 



(2.183a) 
(2.183b) 



or 



Gfiv — 






-E z 


+ Ey 


+cB x 


+E Z 





—E x 


+cB y 


~Ey 


+E X 





+cB z 


-cB x 


-CBy 


-cB z 






(2.184) 



The elements of G^ are obtained from those of F^ by making the transformation 



cBij — > Eij and E{ — > —cBi. 
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Let us now consider the two Maxwell equations 

V B = 0, (2.185a) 

VAE = - — . (2.185b) 

The first of these equations can be written 

-di cB l = diG i4 + diG 44 = 0, (2.186) 
since G 44 = 0. The second equation takes the form 

e ijk djE k = e^ k dj(l/2e kab E ab ) = djE^ = -d 4 cB\ (2.187) 

or 

djG ji + d 4 G 4i = 0. (2.188) 
Equations (2.186) and (2.188) can be combined to give 

dpG**" = 0. (2.189) 

Thus, we conclude that Maxwell's equations for the electromagnetic fields are 
equivalent to the following pair of 4-tensor equations: 

d„F^ = — , (2.190a) 

ceo 

d^ u = 0. (2.190b) 

It is obvious from the form of these equations that the laws of electromagnetism 
are invariant under translations, rotations, special Lorentz transformations, par- 
ity inversions, or any combination of these transformations. 

2.18 The transformation of electromagnetic fields 

The electromagnetic field tensor transforms according to the standard rule 

FV !v> =F ^ pl j/ p u' (2.191) 
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This easily yields the celebrated rules for transforming electromagnetic fields: 

£[l = E h (2.192a) 
B\\ = B \\, (2.192b) 
E'± = j{E ± + vAB), (2.192c) 
B' ± = 7(.Bj_ — v A E/c 2 ), (2.192d) 

where v is the relative velocity between the primed and unprimed frames, and the 
perpendicular and parallel directions are, respectively, perpendicular and parallel 
to v. 

At this stage we may conveniently note two important invariants of the elec- 
tromagnetic field. They are 

i F fiV F f "' = c 2 B 2 - E 2 , (2.193) 

and 

^G fiV F ttv = cE B. (2.194) 
The first of these quantities is a proper-scalar and the second is a pseudo-scalar. 



2.19 The potential due to a moving charge 

Suppose that a particle carrying a charge e moves with uniform velocity u through 
a frame S. Let us evaluate the vector potential A and the scalar potential 4> due 
to this charge at a given event P in S. 

Let us choose coordinates in S so that P = (0,0,0,0) and u = (it, 0,0). Let 
S' be that frame in the standard configuration with respect to S in which the 
charge is (permanently) at rest, say at the point (x',y',z'). In S' the potential 
at P is the usual potential due to a stationary charge 

A' = 0, (2.195a) 
f = 4^7' ( 2 ' 195b ) 
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where r' = \] x' 2 + y' 2 + z' 2 . Let us now transform these equations directly into 
the frame S. Since = (cA, (ft) is a contravariant 4- vector, its components 
transform according to the standard rules (2.57). Thus, 

cA 1 = 7 (cA' 1 + V) = -^7, (2.196a) 
V c / 47reo c r 

cA 2 = cA' 2 = 0, (2.196b) 
cA 3 = cA'g = 0, (2.196c) 

since (3 = — u/c in this case. It remains to express the quantity r' in terms of 
quantities measured in S. The most physically meaningful way of doing this is 
to express r' in terms of retarded values in S. Consider the retarded event at 
the charge for which, by definition, r' = —ct' and r = —ct. Using the standard 
Lorentz transformation (2.19) we find that 

r' = -ct' = -crf(t - ux/c 2 ) = rj(l + u r /c), (2.197) 

where u r = ux/r = r-u/r denotes the radial velocity of the change in S. We can 
now rewrite Eqs. (2.196) in the form 

A = ^ M (2 . 198a) 
4/T [r + r ■u/c\ 

4> = / f - 1 n , (2.198b) 
47reo [r + r -tt/cj 



where the square brackets, as usual, indicate that the enclosed quantities must 
be retarded. For a uniformly moving charge the retardation of u is, of course, 
superfluous. However, since 



& = [ — dV, (2.199) 

it is clear that the potentials depend only on the (retarded) velocity of the charge 
and not on its acceleration. Consequently, the expressions (2.198) give the correct 
potentials for an arbitrarily moving charge. They are known as the Lienard- 
Wiechert potentials. 
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2.20 The electromagnetic field due to a uniformly moving 
charge 



Although the field generated by a uniformly moving charge can be calculated 
from the expressions (2.198) for the potentials, it is simpler to calculate it rela- 
tivistically from first principles. 

Let a charge e, whose position vector at time t = is r, move with uniform 
velocity it in a frame S whose x-axis has been chosen in the direction of u. We 
require to find the field strengths E and B at the event P = (0, 0, 0, 0). Let S' be 
that frame in standard configuration with S in which the charge is permanently 
at rest. In S' the field is given by 

B' = 0, (2.200a) 

E' = (2.200b) 
47re r' 3 

This field must now be transformed into the frame S. The direct method, using 
Eqs. (2.192), is somewhat simpler here, but we shall use a somewhat indirect 
method because of its intrinsic interest. 

In order to express Eq. (2.200) in tensor form, we need the electromagnetic 
field tensor F^ v on the left, and the position 4-vector R? = (r, ct) and the scalar 
e/(47reor' ) on the right. (We regard r' as an invariant for all observers.) To 
get a vanishing magnetic field in S' we multiply on the right by the 4-velocity 

= 7(u)(m, c), thus tentatively arriving at the equation 

F» v = — 5 U ti R v . (2.201) 

Recall that F 4z = — E l and F l i = —cB l K This equation cannot be correct, 
because the antisymmetric tensor can only be equated to another antisym- 
metric tensor. Consequently, we try the equation 

F» v = — T (U^R" - U U R»). (2.202) 

This is found to give the correct field at P in S' as long as refers to any event 
at the charge, no matter which. It only remains to interpret (2.202) in S. It is 
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convenient to choose for that event at the charge at which t = (not the 
retarded event). Thus, 

F jk = _ cB jk = e jrk _ ^ 

4/reo cr' 



giving 



or 



Bi = \ e ijk B> k = --^3 7 («) e^u'r*, (2.204) 



2 47r r 



Likewise, 



or 



B = -^4wAf. (2.205) 

F 4i = -E l = 67 „ r\ (2.206) 

# = r. (2.207) 

Lastly, we must find an expression for r' 3 in terms of quantities measured in S 
at time t = 0. If t' is the corresponding time in S' at the charge, we have 

r' 2 = r 2 + cV 2 = r 2 + = r * ^ + tV ^ (2 _ 2Q8) 

Thus, 

^ = -7^ ^ 1 9/ 9^/9 r > (2.209a) 

47re r 3 (l + M r 2 7 2 /c 2 ) 3 / 2 v y 

B = -^l— 7 9/ 9 ^/^ Ar = ^A£. (2.209b) 

4tt r 3 (l + w r 2 7 2 /c 2 ) 3 / 2 c 2 v y 

Note that E acts in line with the point which the charge occupies at the instant 
of measurement despite the fact that, owing to the finite speed of propagation of 
all physical effects, the behaviour of the charge during a finite period before that 
instant can no longer affect the measurement. Note also that, unlike Eqs. (2.198), 
the above expressions for the fields are not valid for an arbitrarily moving charge, 
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not can they be made valid by merely using retarded values. For whereas accel- 
eration does not affect the potentials, it does affect the fields, which involve the 
derivatives of the potential. 

For low velocities, u/c — > 0, Eqs. (2.209) reduce to the well known Coulomb 
and Biot-Savart fields. However, at high velocities, 7(w) ^> 1, the fields exhibit 
some interesting behaviour. The peak electric field, which occurs at the point of 
closest approach of the charge to the observation point, becomes equal to 7 times 
its non-relativistic value. However, the duration of appreciable field strength at 
the point P is decreased. A measure of the time interval over which the field is 
appreciable is 

At~— , (2.210) 

7c 

where b is the distance of closest approach (assuming 7 ^> 1). As 7 increases, the 
peak field increases in proportion, but its duration goes in the inverse proportion. 
The time integral of the field is independent of 7. As 7 — > 00 the observer at P 
sees electric and magnetic fields which are indistinguishable from the fields of a 
pulse of plane polarized radiation propagating in the ^-direction. The direction 
of polarization is along the radius vector pointing towards the particle's actual 
position at the time of observation. 



2.21 Relativist ic particle dynamics 

Consider a particle which, in its instantaneous rest frame So, has mass mo and 
constant acceleration in the x-direction clq- Let us transform to a frame S, in the 
standard configuration with respect to So, in which the particle's instantaneous 
velocity is u. What is the value of a, the particle's instantaneous ^-acceleration, 
inS? 

The easiest way in which to answer this question is to consider the acceleration 
4-vector (see Eq. (2.85) ) 

^ = 7 (|„ + 7 „,ef). (2.211) 



59 



Using the standard transformation (2.57) for 4-vectors, we obtain 

^I w + 7 a = a , (2.212a) 



dt 

c?7 u cio 

~dt = 

It follows that 

a 

a = -g. 

The above equation can be written 



(2.212b) 



(2.213) 



/ = m o7 3 $, ( 2 - 214 ) 

where / = raoao is the constant force (in the x-direction) acting on the particle 
in £ - 

Equation (2.214) is equivalent to 

,-4=i. (2, 15 ) 

where 

m = 7mo- (2.216) 

Thus, we can account for the ever decreasing acceleration of a particle subject 
to a constant force (see Eq. (2.213) ) by supposing that the inertial mass of the 
particle increases with its velocity according to the rule (2.216). Henceforth, mo 
is termed the rest mass, and m the inertial mass. 

The rate of increase of the particle's energy E satisfies 

dE „ o du . n . 

_ = /M = mo7 3 M _. (2 . 217 ) 

This equation can be written 

dE d(mc 2 ) 

H - —dT> (2 - 218) 
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which can be integrated to yield Einstein's famous formula 

E = mc 2 . 



(2.219) 



The 3-momentum of a particle is defined 

p = mu, (2.220) 

where u is its 3- velocity. Thus, by analogy with Eq. (2.215), Newton's law of 
motion can be written 

/ = f , (2.221) 
where / is the 3-force acting on the particle. 
The 4-momentum of a particle is defined 

P" = m U» = 7 m (u, c) = (p, E/c), (2.222) 
where JJ^ is its 4-velocity. The 4-force acting on the particle obeys 

T» = —— = m A», (2.223) 

UT 

where is its 4-acceleration. It is easily demonstrated that 

^=7(/,e?)= 7 (/,£=), (2.224) 

since 

§ = /■«. (2.225) 
2.22 The force on a moving charge 

The electromagnetic 3-force acting on a charge e moving with 3-velocity u is 
given by the well known formula 

/ = e(E + uAB). (2.226) 
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When written in component form this expression becomes 

f i = e{E i + e ijk u j B k ), (2.227) 

or 

f i = e{E i + B ij u j ), (2.228) 
where use has been made of Eq. (2.155). 

Recall that the components of the E and B fields can be written in terms of 
an antisymmetric electromagnetic field tensor via 

= -F 4i = -Ei, (2.229a) 
Fij = -Fji = -cBij. (2.229b) 

Equation (2.228) can be written 

fi = -— (Fi4 U 4 + U j ), (2.230) 
7 c 

where = 7(1*, c) is the particle's 4- velocity. It is easily demonstrated that 

L± = - E u = - Ei u l = —(F 4i TP + F 44 U 4 ). (2.231) 

C C C J c 

Thus, the 4-force acting on the particle, 

^ = t(-/,^), (2-232) 

can be written in the form 

T ll = ^F llv U v . (2.233) 

The skew symmetry of the electromagnetic field tensor ensures that 

F^U» = - Fp V U»U V = 0. (2.234) 
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This is an important result since it ensures that electromagnetic fields do not 
change the rest mass of charged particles. In order to appreciate this, let us 
assume that the rest mass ttlq is not a constant. Since 

Fix = -^r^ = m A^ + —j— U^, (2.235) 

we can use the standard results U^U 1 ^ = c 2 and A^U 1 ^ = to give 

^C^ = c 2^£. (2.236) 

Thus, if rest mass is to remain an invariant it is imperative that all laws of 
physics predict 4-forces acting on particles which are orthogonal to the particles' 
4-velocities. The laws of electromagnetism pass this test. 



2.23 The electromagnetic energy tensor 

Consider a continuous volume distribution of charged matter in the presence of 
an electromagnetic field. Let there be no particles per unit proper volume (unit 
volume determined in the local rest frame), each carrying a charge e. Consider 
an inertial frame in which the 3- velocity field of the particles is u. The number 
density of the particles in this frame is n = ^(u) uq. The charge density and the 

3- current due to the particles are p = en and j = en it, respectively. Multiplying 
Eq. (2.233) by the proper number density of particles no, we obtain an expression 

U = -V ( 2 - 237 ) 

c 

for the 4-force f ^ acting on unit proper volume of the distribution due to the 
ambient electromagnetic fields. Here, we have made use of the definition J M = 
enoU 1 - 1 . It is easily demonstrated, using some of the results obtained in the 
previous section, that 

/"= ^E + jAB.^j. (2.238) 

The above expression remains valid when there are many charge species (e.g., 
electrons and ions) possessing different number density and 3- velocity fields. The 

4- vector is usually called the Lorentz force density. 
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We know that Maxwell's equations reduce to 

d„F» v = — , (2.239a) 
ce 

d„G^ = 0, (2.239b) 

where is the electromagnetic field tensor and is its dual. As is easily 
verified, Eq. (2.239b) can also be written in the form 

d„F ua + d v F all + daFp,, = 0. (2.240) 

Equations (2.237) and (2.239a) can be combined to give 

U = eoF„ a d^°. (2.241) 

This expression can also be written 

U = eo(d li (F>« r F va ) - F^d^F ua ). (2.242) 

Now, 

F» a d^F ua = ±F^(d^F ua + d a F^), (2.243) 

where use has been made of the antisymmetry of the electromagnetic field tensor. 
It follows from Eq. (2.240) that 

F» a d^F ua = - X -F»° d v F aix = \d v {F^F^). (2.244) 



Thus, 



fu = e (d^F"* F va ) - \d v (F^F^ a ) ) . (2.245) 



The above expression can also be written 

U = ~d^T\, (2.246) 

where 



T\ = e ( F* a F av + \^ v {F pa F pa ) ) (2.247) 
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is called the electromagnetic energy tensor. Note that v is a proper-4-tensor. 
It follows from Eqs. (2.167), (2.169), and (2.193) that 

Tj = €o E i E j + !^-6 i j UeoE k E k + ?^Y (2.248a) 

/Jo 3 2 V Vo J 

T\ = _ T 4, = t l3k EjB k ^ (2.248b) 

Hoc 

Ti = UeoE k E k + ^-). (2.248c) 



Equation (2.246) can also be written 

r = ~d^\ (2.249) 
where T^ v is a symmetric tensor whose elements are 

r>i r>j i / d2 \ 

T ij = -e E i E^ +S ij - le E 2 + — , (2.250a) 

A*o 2 V A*o / 

T i4 = T 4i = (E_/\BY_^ (2.250b) 

Hoc x 

^ = K e °£ 2 + — V (2.250c) 
^ V /Jo / 

Consider the time-like component of Eq. (2.249). It follows from Eq. (2.238) 
that 

= -diT i4 - <9 4 T 44 . (2.251) 

c 

This equation can be rearranged to give 

dW 



dt 

where W = T 44 and e l = cT i4 , so that 



+ Ve = -Ej, (2.252) 



w =*f + & (2 - 253) 
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and 

e = (2.254) 

^0 

The right-hand side of Eq. (2.252) represents the rate per unit volume at which 
energy is transferred from the electromagnetic field to charged particles. It is 
clear, therefore, that Eq. (2.252) is an energy conservation equation for the elec- 
tromagnetic field. The proper-3-scalar W can be identified as the energy density 
of the electromagnetic field, whereas the proper-3-vector e is the energy flux due 
to the electromagnetic field. The latter quantity is called the Poynting vector. 

Consider the space-like components of Eq. (2.249). It is easily demonstrated 
that these reduce to 

^ + V P = -pE - j A B, (2.255) 



where P ij = and g l = T 4i /c, or 

I 

no 2 1 fjt 



P l i = -eo&Ei + 5" - e E 2 + — , (2.256) 



and 

g=-^ = e EAB. (2.257) 

Equation (2.255) is basically a momentum conservation equation for the electro- 
magnetic field. The right-hand side represents the rate per unit volume at which 
momentum is transferred from the electromagnetic field to charged particles. The 
symmetric proper-3-tensor P ZJ is called the Maxwell stress tensor. The element 
P lJ gives the flux of electromagnetic momentum parallel to the ith axis crossing a 
surface normal to the jth axis. The proper-3-vector g represents the momentum 
density of the electromagnetic field. It is clear that the energy conservation law 
(2.252) and the momentum conservation law (2.255) can be combined together 
to give the relativistically invariant energy- momentum conservation law (2.249). 



2.24 The electromagnetic field due to an accelerated charge 

Let us calculate the electric and magnetic fields observed at position x 1 and time 
t due to a charge e whose retarded position and time are x l and t' : respectively. 
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From now on (x\t) is termed the field point and (x 1 ' ,t') is termed the source 
point. It is assumed that we are given the retarded position of the charge as a 
function of its retarded time; i.e., x l (t'). The retarded velocity and acceleration 
of the charge are 



u = 



dx l 
~dP 



and 



u = 



Ht 1 



(2.258) 



(2.259) 



respectively. The radius vector r is defined to extend from the retarded position 
of the charge to the field point, so that r l = x l — x l . (Note that this is the 
opposite convention to that adopted in Sections 2.19 and 2.20). It follows that 



dr 

dt' 



= —u. 



(2.260) 



The field and the source point variables are connected by the retardation condition 



r(x\x i ) = ( 



■■i') 



1/2 



it -If). 



(2.261) 



The potentials generated by the charge are given by the Lienard-Wiechert 
formulae 

A( x \t) = f^-, (2.262a) 

47T S 

<t>{x\t) = (2.262b) 

47TC0 S 

where s = r — r-u/c is a function both of the field point and the source point 
variables. Recall that the Lienard-Wiechert potentials are valid for accelerating 
as well as uniformly moving charges. 

The fields E and B are derived from the potentials in the usual manner 

dA 

E = -V0-— , (2.263a) 
B = VAi. (2.263b) 
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However, the components of the gradient operator V are partial derivatives at 
constant time t, and not at constant time t' . Partial differentiation with respect to 
the x l compares the potentials at neighbouring points at the same time, but these 
potential signals originate from the charge at different retarded times. Similarly, 
the partial derivative with respect to t implies constant x l , and, hence, refers to 
the comparison of the potentials at a given field point over an interval of time 
during which the retarded coordinates of the source have changed. Since we 
only know the time variation of the particle's retarded position with respect to 
t' we must transform d/dt\ x % and d/dx l \ t to expressions involving d/dt'\ x % and 
d/dx%,. 



Now, since x 1 ' is assumed to be given as a function of t' , we have 

r(z\ :/(*')) = r(x\t') = c(t-t'), (2.264) 
which is a functional relationship between x l , t, and t' . Note that 

/ dr\ ru 



dt 



(2.265) 



X 1 



It follows that 

dr / dt'\ dr dt' r-u dt' /n . 

dl = c V-m ) = Wm=-— W (2 - 266) 



where all differentiation is at constant x % . Thus, 

dt' 1 r 



dt 1 — r-u/rc s 

giving 



d r d 



dt s dt' 



(2.267) 



(2.268) 



Similarly, 



UV V V • u 

Vr = -cVt' = Vr + — Vt' = W, (2.269) 

dt' r r 

where V denotes differentiation with respect to x l at constant t' . It follows that 

Vt' = - — , (2.270) 

sc 
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so that 



v=v- * a 



scdt' 



Equation (2.263a) yields 



or 

However, 
and 

Thus, 
^E 



47reo „ Vs d u 
E = — — 



dt 



sc A 



47reo Vs r 5s 
£/ = 



r . ru ds 

U + 7, 7 7777 • 



s 2 s 3 cdt' s 2 c 2 s 3 c 2 dt' 



Vs 



r u 



r c 



ds dr rii u- u 

dt' = df ~ ~c~ + ~c~ 



s 2 r V c / s 3 c V c / 



r- u ru u 

+ — . 

r c c 



ru u ru\ r 

+ 7- 



which reduces to 

47Te 



_ 1 / rit\ / w 2 \ 1 / r/ rit\ 



A u 



Similarly, 



4tt „ „ it Vs A it r /it w <9s 
B = V A - = = A ^ 



or 



fjt e 

47T 



sc \ s s 



2 / ' 



„ r Am r 

B = = A 



Hoe 

which reduces to 



ST SC 



u u ru ru u 

- + "2 + 

s s z \ r c c 



47r„ it A r ( u 2 \ 1 r 

B = — 3- U--2 + -3— A 

/^o e s J V cr / s^c r 



( rA [( r -v)H) 
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A comparison of Eqs. (2.277) and (2.280) yields 

B= r -^. (2.281) 

rc 

Thus, the magnetic field is always perpendicular to E and the retarded radius 
vector r. Note that all terms appearing in the above formulae are retarded. 

The electric field is composed of two separate parts. The first term in Eq. (2.277) 
varies as 1/r 2 for large distances from the charge. We can think of r u = r — ru/c 
as the virtual present radius vector, i.e., the radius vector directed from the posi- 
tion the charge would occupy at time t if it had continued with uniform velocity 
from its retarded position to the field point. In terms of r u the 1/r 2 field is simply 



u 2 /& 

47TC0 S c 



^induction = ~. o fu- (2.282) 



We can rewrite the expression (2.209a) for the electric field generated by a uni- 
formly moving charge in the form 



47re r 3 (l - u 2 /c 2 + u 2 /c 2 f/ 2 r °' 

where ro is the radius vector directed from the present position of the charge at 
time t to the field point, and u r = u-ro/ro. For the case of uniform motion the 
relationship between the retarded radius vector r and the actual radius vector ro 
is simply 

r = r - - u. (2.284) 
c 

It is straightforward to demonstrate that 



s = r ^l-u 2 /c 2 + u r 2 /c 2 (2.285) 



in this case. Thus, the electric field generated by a uniformly moving charge can 
be written 

e l — u 2 /c 2 

Since r u = ro for the case of a uniformly moving charge, it is clear that Eq. (2.282) 
is equivalent to the electric field generated by a uniformly moving charge located 



E=- ^—r . 2.286 

A oO 
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at the position the charge would occupy if it had continued with uniform velocity 
from its retarded position. 



-^radiation — A _ 2 3 ' l^^o' ) 



The second term in Eq. (2.277), 

e r A (r u A u) 

47TCoC 2 s J 

is of order 1/r and, therefore, represents a radiation field in the sense of con- 
tributing to the energy flux over a large sphere. Similar considerations hold for 
the two terms of Eq. (2.280). 



2.25 The Lamor formula 

Let us transform to the inertial frame in which the charge is instantaneously at 
rest at the origin at time t = 0. In this frame u <C c, so that r u ~ r and s ~ r, for 
events which are sufficiently close to the origin at t = that the retarded charge 
still appears to travel with a velocity which is small compared to that of light. It 
follows from the previous section that 



e r A (r A it) 
E Tad ~ ^ —3 2.288a 



6 U A 7* 

B rad - "j a—a"- (2.288b) 

Let us define spherical polar coordinates whose axis points along the direction of 
instantaneous acceleration of the charge. It is easily demonstrated that 

,_, e sin . , „ „ . 

E ~ ^ w, 2.289a 

4tt€oc z r 

~ — — w. (2.289b) 

^ 47re c 3 r 

These fields are identical to those of a radiating dipole whose axis is aligned along 
the direction of instantaneous acceleration. The Poynting flux is given by 

EqBa e 2 sin 2 # 9 

e r = = 2 3 2 2.290 
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We can integrate this expression to obtain the instantaneous power radiated by 
the charge 

P=^-^u 2 - (2.291) 

This is known as Lamor's formula. Note that zero net momentum is carried off 
by the fields (2.289). 

In order to proceed further it is necessary to prove two useful theorems. The 
first theorem states that if a 4- vector field T M satisfies 

djjT 1 * = 0, (2.292) 

and if the components of T M are non-zero only in a finite spatial region, then the 
integral over 3-space, 

1 = J T 4 d 3 x, (2.293) 

is an invariant. In order to prove this theorem we need to use the 4-dimensional 
analog of Gauss's theorem, which states that 

/ d„T» d 4 x = <f T» dSp, (2.294) 
Jv Js 

where dS^ is an element of the 3-dimensional surface S bounding the 4-dimensional 
volume V. The particular volume over which the integration is performed is indi- 
cated in Fig. 1. The surfaces A and C are chosen so that the spatial components 
of T M vanish on A and C. This is always possible because it is assumed that 
the region over which the components of T M are non-zero is of finite extent. The 
surface B is chosen normal to the x 4 -axis whereas the surface D is chosen normal 
to the x 4 -axis. Here, the x^ and the x^ are coordinates in two arbitrarily chosen 
inertial frames. It follows from Eq. (2.294) that 

J T 4 dS A + j T 4 ' dS A , = 0. (2.295) 

Here, we have made use of the fact that T M dS/j, is a scalar and, therefore, has the 
same value in all inertial frames. Since dS^ = — d 3 x and dS^ = d 3 x' it follows 
that / = J T 4 d 3 x is an invariant under a Lorentz transformation. Incidentally, 
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dS 4 




Figure 1: The region of integration for proving the theorem associated with 
Eq. (2.293) 



the above argument also demonstrates that / is constant in time (just take the 
limit in which the two inertial frames are identical). 

The second theorem is an extension of the first. Suppose that a 4-tensor field 
satisfies 

d^CP" = 0, (2.296) 

and has components which are only non-zero in a finite spatial region. Let be 
a 4-vector whose coefficients do not vary with position in space-time. It follows 
that T M = A V Q* V satisfies Eq. (2.292). Therefore, 

1 = J A u Q Au d 3 x (2.297) 

is an invariant. However, we can write 

/ = A^B*, (2.298) 

where 




(2.299) 



73 



It follows from the quotient rule that if A^B^ is an invariant for arbitrary 
then must transform as a constant (in time) 4-vector. 

These two theorems enable us to convert differential conservation laws into 
integral conservation laws. For instance, in differential form the conservation of 
electrical charge is written 

d„ J** = 0. (2.300) 
However, from Eq. (2.293) this immediately implies that 

Q=\ J J 4 d 3 x = J pd 3 x (2.301) 

is an invariant. In other words, the total electrical charge contained in space is 
both constant in time and the same in all inertial frames. 

Suppose that S is the instantaneous rest frame of the charge. Let us consider 
the electromagnetic energy tensor T^ v associated with all of the radiation emitted 
by the charge between times t = and t = dt. According to Eq. (2.249) this tensor 
field satisfies 

d ll T ttv = 0, (2.302) 

apart from a region of space of measure zero in the vicinity of the charge. Fur- 
thermore, the region of space over which T^ v is non-zero is clearly finite, since we 
are only considering the fields emitted by the charge in a small time interval, and 
these fields propagate at a finite velocity. Thus, according to the second theorem 

P» = I j T 4m d 3 x (2.303) 

is a 4-vector. It follows from Section 2.23 that we can write P M = (dp,dE/c), 
where dp and dE are the total momentum and energy carried off by the radiation 
emitted between times t = and t = dt, respectively. As we have already men- 
tioned, dp = in the instantaneous rest frame S. Transforming to an arbitrary 
inertial frame S' in which the instantaneous velocity of the charge is u, we obtain 

dE' =~f(u)(dE + udp 1 ) =jdE. (2.304) 
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However, the time interval over which the radiation is emitted in S f is dt' = jdt. 
Thus, the instantaneous power radiated by the charge, 



is the same in all inertial frames. 



dE' dE 



dt' 



dt 



(2.305) 



We can make use of the fact that the power radiated by an accelerating charge 
is Lorentz invariant to find a relativistic generalization of the Lamor formula 
(2.291) which is valid in all inertial frames. We expect the power emitted by the 
charge to depend only on its 4-velocity and 4-acceleration. It follows that the 
Lamor formula can be written in Lorentz invariant form as 



67reoc 3 M 



since the 4-acceleration takes the form A 11 
frame. In a general inertial frame 



(2.306) 

(it, 0) in the instantaneous rest 



2 ( ' dl 

7 U" +7 " 



d^ 

rc ~ 1 Tt 



.2 2 



(2.307) 



where use has been made of Eq. (2.85). Furthermore, it is easily demonstrated 
that 



g?7 



q U- U , 

di = 7 V- ( 2 - 308 > 

It follows, after a little algebra, that the relativistic generalization of Lamor's 
formula takes the form 



67reo<^ 



7 



u 



(u A u)' 



(2.309) 



2.26 Radiation losses in charged particle accelerators 

Radiation losses often limit the maximum practical energy attainable in a charged 
particle accelerator. Let us investigate radiation losses in various different types 
of accelerator device using the relativistic Lamor formula. 
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For a linear accelerator the motion is one dimensional. In this case, it is easily 
demonstrated that 

^ = m 7 3 w, (2.310) 

where use has been made of Eq. (2.308), and p = ^itlqu is the particle momentum 
in the direction of acceleration (the x-direction, say). Here, mo is the particle 
rest mass. Thus, Eq. (2.309) yields 

The rate of change of momentum is equal to the force exerted on the particle in 
the ^-direction, which in turn equals the change in the energy, E, of the particle 
per unit distance. Consequently, 



e 2 fdE 



67reom 2 c 3 \dx 



(2.312) 



Thus, in a linear accelerator the radiated power depends on the external force 
acting on the particle, and not on the actual energy or momentum of the particle. 
It is obvious from the above formula that light particles such as electrons are 
going to radiate a lot more than heavier particles such as protons. The ratio of 
the power radiated to the power supplied by the external sources is 



e 2 IdE e 2 1 dE 



dE/dt 67Teom^c 3 u dx 67reoWoc 2 ttiqc 2 dx ' 



(2.313) 



since u ~ c for a highly relativistic particle. It is clear from the above ex- 
pression that the radiation losses in an electron linear accelerator are negligi- 
ble unless the gain in energy is of order m e c 2 = 0.511 MeV in a distance of 
e 2 /(67re m e c 2 ) = 1.28 x 10 -15 meters. That is 3 x 10 14 MeV/meter. Typical 
energy gains are less that 10 MeV/meter. It is, therefore, obvious that radiation 
losses are completely negligible in linear accelerators, whether for electrons or for 
other heavier particles. 

The situation is quite different in circular accelerator devices such as the 
synchrotron and the betatron. In such machines the momentum p changes rapidly 
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in direction as the particle rotates, but the change in energy per revolution is 
small. Furthermore, the direction of acceleration is always perpendicular to the 
direction of motion. It follows from Eq. (2.309) that 

O/TeoC' 3 o7reoC' 3 p z 

where p is the orbit radius. Here, use has been made of the standard result 
u = u 2 / p for circular motion. The radiative energy loss per revolution is given 
In 

SE= ^ P= (2.315) 
u oeoc- 5 p 

For highly relativistic (u ~ c) electrons this expression yields 

SE(MeV) = 8.85 x 1(T 2 ^^il) • (2.316) 

In the first electron synchrotrons, p ~ 1 meter, -E max ~ 0.3 GeV. Hence, SE max ~ 
1 keV per revolution. This was less than, but not negligible compared to, the en- 
ergy gain of a few keV per turn. For modern electron synchrotrons the limitation 
on the available radio-frequency power needed to overcome radiation losses be- 
comes a major consideration, as is clear from the E 4 dependence of the radiated 
power per turn. 



2.27 The angular distribution of radiation emitted by an 
accelerated charge 

In order to calculate the angular distribution of the energy radiated by an ac- 
celerated charge we must think carefully about what is meant by the "rate of 
radiation" of the charge. This quantity is actually the amount of energy lost by 
the charge in a retarded time interval dt' during the emission of the signal. Thus, 

P(t') = -fp, (2.317) 
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where E is the energy of the charge. The Poynting vector 

-E'rad A -Brad rp 2 T / oio\ 

e = = e cE rad -, (2.318) 

where use has been made of B Tad = (f A E vai &)/rc (see Eq. (2.281) ), represents 
the energy flux per unit actual time, t. Thus, the energy loss rate of the charge 
into a given element of solid angle dQ is 

*?gldn = - d -^A d Q = \e\ ^r 2 dQ = e c E 2 d S - r 2 <M, (2.319) 
dQ dt' 1 1 dt' rad r v J 

where use has been made of Eq. (2.267). Here, 9 and ip are angular coordinates 
used to locate the element of solid angle. It follows from Eq. (2.287) that 

dP(t') e 2 r [r A (r u A u)] 2 



dQ 167r 2 eoc 3 s 5 



(2.320) 



Consider the special case where the direction of acceleration coincides with the 
direction of motion. Let us define spherical polar coordinates whose axis points 
along this common direction. It is easily demonstrated that the above expression 
reduces to 

dP(t') e 2 u 2 sin 2 9 

dQ " 167r 2 e c 3 [1 - (u/c) cos9f K ] 

in this case. In the non-relativistic limit u/c — > the radiation pattern has 
the familiar sin 2 9 dependence of dipole radiation. In particular, the pattern is 
symmetric in the forward (9 < 7r/2) and backward (9 > tt/2) directions. However, 
as u/c — > 1 the radiation pattern becomes more and more concentrated in the 
forward direction. The angle # ma x for which the intensity is a maximum is 



9marx — COS . 

6 u/c 



^—{^l + lbu 2 /c 2 -l) 



(2.322) 



This expression yields 9 max -> 7r/2 as u/c and # ma x — > 1/(27) as u/c -> 1. 
Thus, for a highly relativistic charge the radiation is emitted in a narrow cone 
whose axis is aligned along the direction of motion. In this case, the angular 
distribution (2.321) reduces to 

dP(t') 2e 2 u 2 8 ( 7 fl) 2 
dQ ~ 7r 2 eo c 37 [l + ( 7 0) 2 ] 5 ' ( > 
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The total power radiated by the charge is obtained by integrating Eq. (2.321) 
over all solid angles. We obtain 

r sin 3 Ode f +1 (l-^)df, 

{ ] 87re oC 3 J [1 - (u/c) cosfl]* Stt^ 3 ]_ x [1 - (u/c)^ ' 1 J 

It is easily verified that 

f +1 (l-f)d» _ 4 6 

[i - m & ~ 3 7 • (2 - 325) 



Hence, 



e 



which agrees with Eq. (2.309) provided that u A u = 0. 



2.28 Synchrotron radiation 

Synchrotron radiation (z.e., radiation emitted by a charged particle constrained 
to follow a circular orbit by a magnetic field) is of particular importance in as- 
trophysics, since much of the observed radio frequency emission from supernova 
remnants and active galactic nuclei is thought to be of this type. 

Consider a charged particle moving in a circle of radius a with constant angular 
velocity ujq. Suppose that the orbit lies in the x-y plane. The radius vector 
pointing from the centre of the orbit to the retarded position of the charge is 
defined 

p = a (cos 0, sin 0,0), (2.327) 

where 4> = ujq t' is the angle subtended between this vector and the 2?-axis. The 
retarded velocity and acceleration of the charge take the form 



dp 

dt' 
du 



u = — = u (— sin 0, cos 0, 0), (2.328a) 



it = -— = — u (cos 0, sin 0, 0) , (2.328b) 

Lb L 
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where u = clojq and u = acu 2 . The observation point is chosen such that the ra- 
dius vector r, pointing from the retarded position of the charge to the observation 
point, is parallel to the y-z plane. Thus, we can write 



r = r (0, sin a, cos a), 



(2.329) 



where a is the angle subtended between this vector and the z-axis. As usual, 
we define 9 as the angle subtended between the retarded radius vector r and the 
retarded direction of motion of the charge u. It follows that 



It is easily seen that 



u-r 

cos 9 = = sin a; cos0. 

u r 



u-r = — ur sin a sin0. 



A little vector algebra shows that 

[r A (r u A ii)f = -(r-u) 2 r 2 (1 - u 2 /c 2 ) + u 2 r 4 (1 - r-u/rc) 2 , 
giving 



2 _4 



[r A (r u A u)} 2 = u l r 



u 



tan cf) cos # 



(2.330) 



(2.331) 



(2.332) 



(2.333) 



Making use of Eq. (2.320), we obtain 

dP(t') e 2 u 2 [1 - (u/c) cos0)] 2 - (1 -w 2 /c 2 )tan 2 cos 2 



dQ " 167r 2 e c 3 



[1 - (u/c) COS0] 5 



(2.334) 



It is convenient to write this result in terms of the angles a and 0, instead of 9 
and <j). After a little algebra we obtain 



dP(t') e it 2 [1 — (u 2 /c 2 )] cos 2 a + [(u/c) — sin a cos<f>]' 



dQ 167r 2 e c 3 



[1 — (u/c) sin a: cos0] 5 



(2.335) 



Let us consider the radiation pattern emitted in the plane of the orbit; i.e., 
a = 7r/2, with cos0 = cos# . It is easily seen that 



dP(t') e 2 u 2 [(u/c) -cos 0\ 



dQ 167r 2 e c 3 [l-(u/c)cos^] J 



(2.336) 
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In the non-relativistic limit the radiation pattern has a cos 2 9 dependence. Thus, 
the pattern is like that of dipole radiation where the axis is aligned along the 
instantaneous direction of acceleration. As the charge becomes more relativistic 
the radiation lobe in the forward direction (i.e., < 9 < 7r/2) becomes more 
more focused and more intense. Likewise, the radiation lobe in the backward 
direction (i.e., 7r/2 < 9 < n) becomes more diffuse. The radiation pattern has 
zero intensity at the angles 

O = cos _1 (u/c). (2.337) 

These angles demark the boundaries between the two radiation lobes. In the 
non-relativistic limit 9q = ±7r/2, so the two lobes are of equal angular extents. 
In the highly relativistic limit 9q — > ±1/7, so the forward lobe becomes highly 
concentrated about the forward direction (9 = 0). In the latter limit Eq. (2.336) 
reduces to 

dP(t') e 2 u 2 6 [l-( 7 0) 212 



dQ ~ 27r 2 e c 3 7 [1 + ( 7 0) 2 ] 5 ' ' 2 ' 338 ) 

Thus, the radiation emitted by a highly relativistic charge is focused into an 
intense beam of angular extent I/7 pointing in the instantaneous direction of 
motion. The maximum intensity of the beam scales like 7 s . 

Integration of Eq. (2.335) over all solid angle (using dQ = sin a da d</>) yields 
(not very easily!) 



^ei 74 * 2 ' (2 - 339) 

which agrees with Eq. (2.309) provided that u ii = 0. This expression can also 
be written 

^P2<Vr„ 
moc z 6 c 

where ro = e 2 /(47reo itlqc 2 ) = 2.82 x 10 -15 meters is the classical electron radius, 
rriQ is the rest mass of the charge, and (3 = u/c. If the circular motion takes place 
in an orbit of radius a perpendicular to a magnetic field B, then ujq satisfies 
uq = eB/mo'j. Thus, the radiated power is 



eB \ 2 r 



(/3 7 ) 2 , (2.341) 



ttiqc 2 3 \rriQ J c 
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and the radiated energy AE per revolution is 



AE =£?V7 4 . (2-342) 



rriQC 2 3 a 

Let us consider the frequency distribution of the emitted radiation in the 
highly relativistic limit. Suppose, for the sake of simplicity, that the observation 
point lies in the plane of the orbit (i.e., a = 7r/2). Since the radiation emitted 
by the charge is beamed very strongly in the charge's instantaneous direction 
of motion, a fixed observer is only going to see radiation (at some later time) 
when this direction points almost directly towards the point of observation. This 
occurs once every rotation period when ~ 0, assuming that ujq > 0. Note 
that the point of observation is located many orbit radii away from the centre 
of the orbit along the positive y-axis. Thus, our observer sees short periodic 
pulses of radiation from the charge. The repetition frequency of the pulses (in 
radians per second) is uq. Let us calculate the duration of each pulse. Since the 
radiation emitted by the charge is focused into a narrow beam of angular extent 
AO ~ I/7, our observer only sees radiation from the charge when (f) < AO. Thus, 
the observed pulse is emitted during a time interval At' = A0/ujq. However, the 
pulse is received in a somewhat shorter time interval 



At = 



AO 

UJ 



(l - , (2.343) 



because the charge is slightly closer to the point of observation at the end of the 
pulse than at the beginning. The above equation reduces to 

AO 1 , 
^-9 2 3' 2 - 344 

since 7 ^> 1 and AO ^1/7. The width Acu of the pulse in frequency space obeys 
Au At ~ 1. Hence, 

Au = 7 3 co . (2.345) 

In other words, the emitted frequency spectrum contains harmonics of frequency 
up to 7 3 times that of the fundamental, ujq. 
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More involved calculations 8 show that in the ultra-relativistic limit 7 ^> 1 the 
power radiated in the Ith harmonic (whose frequency is uj = Iujq) is given by 

P l = 0.52 (-^ — ^ lo 2 l 1/3 (2.346) 
\47re cy 

for 1 « k< 7 3 , and 

P ' = 2^ (ii) ^ (^) 1/2 - P [(-2/3)(V7 3 )] (2.347) 

for / ^> 7 3 . Note that the spectrum cuts off approximately at the harmonic order 
7 3 , as predicted earlier. It can also be demonstrated 9 that seven times as much 
energy is radiated with a polarization parallel to the orbital plane than with a 
perpendicular polarization. A P{uS) oc a; 1 / 3 power spectrum at low frequencies 
coupled with a high degree of polarization are the hallmarks of synchrotron radi- 
ation. In fact, these two features are used in astrophysics to identify synchrotron 
radiation from supernova remnants, active galactic nuclei, etc. 



8 L. Landau, and E. Lifshitz, The classical theory of fields, (Addison- Wesley, 1951), pp. 215 

ff. 

9 J.D. Jackson, Classical electrodynamics, (Wiley, 1962), pp. 672 ff. 
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